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Abstract: We compute explicit bounds in the normal and chi-square approximations of multi- 
. linear homogenous sums (of arbitrary order) of general centered independent random variables 
[ with unit variance. Our techniques combine an invariance principle by Mossel, O'Donnell and 
Oleszkiewicz with a refinement of some recent results by Nourdin and Peccati, about the approx- 
Qh" imation of laws of random variables belonging to a fixed (Gaussian) Wiener chaos. In particular, 
■ we show that chaotic random variables enjoy the following form of universality: (a) the normal 
. and chi-square approximations of any homogenous sum can be completely characterized and 
^ assessed by first switching to its Wiener chaos counterpart, and (b) the simple upper bounds 

' and convergence criteria available on the Wiener chaos extend almost verbatim to the class of 
homogeneous sums. These results partially rely on the notion of "low influences" for functions 
i -^H ■ defined on product spaces, and provide a generalization of central and non-central limit theorems 
. proved by Nourdin, Nualart and Peccati. They also imply a further drastic simplification of the 
^ . method of moments and cumulants - as applied to the proof of probabilistic limit theorems - 
[ and yield substantial generalizations, new proofs and new insights into some classic findings by 
' de Jong and Rotar'. Our tools involve the use of Malliavin calculus, and of both the Stein's 
• method and the Lindeberg invariance principle for probabilistic approximations. 
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1 Introduction 



1.1 Overview 

The aim of this paper is to study and characterize the normal and chi-square approxi- 
mations of the laws of multilinear homogeneous sums involving general independent ran- 
dom variables. We shall perform this task by implicitly combining three probabilistic 
techniques, namely: (i) the Lindeberg invahance principle (in the version developed by 
Mossel et al. [20] and Rotar' [ID], see also Mossel [S]), (ii) the Stein's method for the 
normal and chi-square approximations (see e.g. [HI [36l HH 112]), and (iii) the Malliavin 
calculus of variations on a Gaussian space (see e.g. [I6] and [2Z])- Our analysis will reveal 
that the Gaussian Wiener chaos (see Section [2] below for precise definitions) enjoys the 
following properties: (a) the normal and chi-square approximations of any multilinear 
homogenous sum are completely characterized and assessed by those of its Wiener chaos 
counterpart, and (b) the strikingly simple upper bounds and convergence criteria avail- 
able on the Wiener chaos (see [2T], [221 ESI EH ESI EHj) extend almost verbatim to the class 
of homogeneous sums. Our findings partially rely on the notion of "low influences" (see 
again [20]) for real- valued functions defined on product spaces. As indicated by the title, 
we regard the two properties (a) and (b) as an instance of the universality phenomenon, 
according to which most information about large random systems (such as the "distance 
to Gaussian" of non-linear functionals of large samples of independent random variables) 
does not depend on the particular distribution of the components. Other recent examples 
of the universality phenomenon appear in the already quoted paper [20], as well as in the 
Tao-Vu proof of the circular law for random matrices, as detailed in [13] (see also the Ap- 
pendix to [IS] by Krishnapur). Observe that in Section [7] we will prove analogous results 
for the multivariate normal approximation of vectors of homogenous sums of possibly 
different orders. 

1.2 The approach 

In what follows, every random object is defined on a suitable (common) probability space 
{Q,^,P). The symbol E denotes expectation with respect to P. We start by giving a 
precise definition of the main objects of our study. 

Definition 1.1 (Multilinear homogeneous sums) Fix some integers N,d ^ 2 and 
write [A^] = {!,..., A^}. Let X = {Xi : z ^ 1} be a collection of centered independent 
random variables, and let / : [A^]*^ — > M be a symmetric function vanishing on diagonals 
(that is, f{ii,...,id) = whenever there exist k ^ j such that ik = ij)- The random 
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variable 



Q,(iv,/,x) = grf(x) = Yl fiti,:.,id)x,,---x,^ (1.1) 

l^ii,...,id^N 

= d\ Yl /(^i,-,^d)X,, ■■■Xi, (1.2) 

{ii,...,ia}c[N]'i 

= d\ J2 f{ii.....id)Xi,---X,^ (1.3) 

li^ii<...<ia^N 

is called the multilinear homogeneous sum, of order d, based on / and on the first 
elements of X. As in the first equality of (11.11) . and when there is no risk of confusion, 
we will drop the dependence on and / in order to simplify the notation. Plainly, 
ElQdCX.)] = and also, if E{Xf) = 1 for every i, then 

E[Q,{Xr] = d\\\f\\l (1.4) 

where, here and for the rest of the paper, we use the notation 

11/11^ = E /'(^i'-'^^)- 

l^ii,...,ia^N 

In the following, we will systematically use the expression "homogeneous sum" instead of 
"multilinear homogeneous sum". 

Objects such as the RHS of (11.11) are sometimes called "polynomial chaoses", and 
play a central role in several branches of probability theory and stochastic analysis. For 
instance: 

- In the case d = 2, homogeneous sums are indeed standard quadratic forms with 
no diagonal terms. Limit theorems for quadratic forms have been the object of 
an intense study since 30 years. An exhaustive discussion about the best available 
results on the approximation of the laws of quadratic forms can be found in Gotze 
and Tikhomirov [9]. 

- By setting Xi = Hq.{Gi), where {Gj} is an independent and identically distributed 
(i.i.d.) centered standard Gaussian sequence and Hq is the Hermite polynomial of 
degree q, one obtains the basic building blocks of the so-called Gaussian Wiener 
chaos (see Section |2]), whose properties are at the core of the Malliavin calculus (see 
e.g. [II1I2Z]) and of the white noise analysis (see e.g. [HU]). 

- By setting Xi = Jq^{Pi), where Jq indicates the Charlier polynomial of degree q and 
{Pi} are the values of a centered Poisson measure over disjoint sets, one obtains a 
class of random elements generating the Poisson Wiener chaos (see e.g. [311 E21 ESI 

EH]). 
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- By setting X to be a Rademacher sequence (that is, the Xj's are i.i.d. and such 
that P{Xi = 1) = P{Xi = — 1) = 1/2) then Qd(X.) are the constitutive elements 
of the so-called Walsh chaos, playing a central role e.g. in harmonic analysis (see 
[T]), probability on Banach spaces (see [13]), quantum probability (see [IHl Ch. 2]), 
social choice theory and theoretical computer sciences (see [20], and the references 
therein, for an overview of results in these last two areas). 

Despite the almost ubiquitous nature of homogeneous sums, results concerning the 
normal approximation of quantities such as fll.ip in the non-quadratic case (i.e., when 
d ^ 3) are surprisingly scarce: indeed, to our knowledge, the only general statements in 
this respect are contained in the paper [6] and the monograph [5], both by P. de Jong 
(as discussed below, and in a different direction, general criteria allowing to assess the 
proximity of the laws of homogenous sums based on different independent sequences are 
obtained in [201 EHl SO]). 

Remark 1.2 We stress that (11.11) can be viewed as a sum of locaUy dependent random 
variables, with index set JT" = {l,...,n}'^. Note moreover that the techniques usually 
adopted in the normal approximation of sums of dependent variables are not appropriate 
here. For instance, one cannot adopt the approach developed by Chen and Shao in [2]: 
indeed, in our framework the neighborhoods of dependence 

= {j = e : |inj| 7^ 0}, i = (ii,...,id) G J, 

verify the relation UjgAi^j = i7 and are consequently "too large". 
In this paper, we are interested in controlling objects of the type 

d^iQ,{X) ■ Z}, 

where: (i) Qrf(X) is defined in (11. II) . (ii) Z is either a standard Gaussian ^(0,1) or a 
centered chi-square random variable, and (iii) the distance djt{F; G}, between the laws 
of two random variables F,G, is given by 

d,^^{F; G} = snp {\E[h{F)] - E[h{G)]\ : h G JT}, (1.5) 

{E denotes expectation) with Jif some suitable class of real-valued functions. 

Even with some uniform control on the components of X, the problem of directly and 
generally assessing d,^{Qd(X.) ; Z} looks very arduous. Indeed, any estimate comparing 
the laws of Qd(X.) and Z capriciously depends on the kernel /, and on the way in which 
the analytic structure of / interacts with the specific "shape" of the distribution of the 
random variables Xi. One revealing picture of this situation appears if one tries to evaluate 
the moments of Qd(X.) and to compare them with those of Z: see e.g. [33l B3] for a 
discussion of some associated combinatorial structures. In the specific case where Z is 
Gaussian, one should also observe that Qd(X.) is a completely degenerate U -statistic, as 
E[Qd{Xi; X2, . . . , Xd)] = for all X2, ■ ■ ■ , Xd, so that the standard results for the normal 
approximation of ^/-statistics do not apply. 



5 



The main point developed in this paper is that one can successfully overcome these 
difficulties by implementing the following strategy: ffist (I) measure the distance 

between the law of (5d(X) and the law of the random variable Qd{G)i obtained by re- 
placing X with a centered standard i.i.d. Gaussian sequence G = {Gi : i ^ 1}, then (II) 
assess the distance {Qrf(G) ; Z}, and finally (III) use the triangle inequality in order 
to write 

d^iQdm ; Z} ^ rf^{gd(X) ; g,(G)} + d^{Qd{G) ; Z}. (1.6) 

We will see in the subsequent sections that the power of this approach resides in the 
following two facts. 

- The distance evoked at Point (I) can be effectively controlled by means of the tech- 
niques developed in [20], where the authors have produced a general theory, based 
on a generalization of the so-called Lindeberg invariance principle (see e.g. |1S]), al- 
lowing to estimate the distance between multilinear homogeneous sums constructed 
from different sequences of independent random variables. A full discussion of this 
point is presented in Section H] below. In Theorem 14.11 we shall observe that, under 
the assumptions that E{Xf) = 1 and that the moments -EdXjl^) are uniformly 
bounded by some constant j3 > (recall that the Xj's are centered), one can deduce 
from [20] that (provided that the elements of Jif are sufficiently smooth). 



d,^{QdiX.);Q,iG)}^Cx max V Pit,t,, (1.7) 

\ / \<ii<rJ f ^ 



{i2,...,idMN]''-^ 



where C is a constant depending on d, (3 and on the class J^. The quantity 

Inf,(/)= Yl ^'(^'^2,...,^d) = 7^^ Yl f(^,^2,...,^d) (1.8) 

is called the influence of the variable i, and roughly quantifies the contribution of 
Xi to the overall configuration of the homogenous sum (5d(X). Influence indices 
already appear (under a different name) in the papers by Rotar' 



The random variable Qd(G) is an element of the dth Wiener chaos associated with 
G (see Section El for definitions). As such, the distance between Qd{G) and Z (in 
both the normal and the chi-square cases) can be assessed by means of the results 
appearing in [2Il[221[231[21[2Sl[2Hl[2niEl], which are in turn based on a powerful 
interaction between standard Gaussian analysis. Stein's method, and the Malliavin 
calculus on variations. As an example. Theorem 13.11 of Section [3] proves that the 
above quoted results imply that, if Qd{G) has variance one and Z is standard 
Gaussian, then 



d.^{QdiG) ; Z} ^ C^/\E[QdiG)^]-E{Z^)\ = C ^/\E[QdiG)^] - 3\ (1.9) 
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where C > is some finite constant depending only on and d. Moreover, if 
tlie elements of X have bounded fourth moments, then the RHS of fll.9p can be 
directly estimated by means of the fourth moment of Qd(X.) and some maxima over 
the influence functions appearing in (11.71) (see Section [5] for a full discussion of this 
point). 

1.3 Universality 

Bounds such as (11.61) . (11.71) and (II. 9p only partially account for the term "universality" 
appearing in the title of the present paper. Our techniques allow indeed to prove the 
following statement, involving vectors of homogeneous sums of possibly different orders, 
see Theorem 17. 5[ 

Theorem 1.3 (Universality of Wiener chaos) Let G = {Gi : i ^ 1} be a standard 
centered i.i.d. Gaussian sequence, and fix integers m ^ 1 and di,...,dm. ^ 2. For every 
j = 1, ...,m, let {{Nn \ fn'^) : 1} he a sequence such that {Nn^ : n 1} is a sequence 
of integers going to infinity, and each function fn^ : [Nn^Y^ — M symmetric and 
vanishes on diagonals. Define Qdj{Nn \ fn\G), n^l, according to (iXjp. Assume that, 
for every j = 1, ...m, the following sequence of variances is bounded: 

E[Qd^{Ni^\fi^\Gn n^l. (1.10) 

Let V be a m X m non-negative symmetric matrix, and let ^„(0, V) indicate a centered 
Gaussian vector with covariance matrix V. Then, as n —>■ oo, the following two conditions 
are equivalent. 

(1) The vector {Qdj{Nn \ fn \ G) : j = 1, m} converges in law to JKn{^, V)- 

(2) For every sequence X = {Xi : i ^ 1} of independent centered random vari- 
ables, with unit variance and such that supj-E|Xj|^ < oo, the law of the vector 
{Qdj{Nn\ fn\^) '■ j = l,...,m} converges to the law of .A^{0,V) in the Kol- 
mogorov distance. 

Remark 1.4 1. Given random vectors F = {Fi, F^) and H = {Hi, Hm), m ^ 1, 
the Kolmogorov distance between the law of F and the law of H is defined as 

dKoi{F,H)= sup \P{Fi ^ Zi,.. ., F^ ^ Z„,)-P{Hi ^ Zi,. .., Hm ^ Zm)\. {l.ll) 

(2l,...,2:„)eK™ 

Recall that the topology induced by dxoi on the class of all probability measures on 
M*" is strictly stronger than the topology of convergence in distribution. 

2. Note that, in the statement of Theorem 11.31 we do not require that the matrix V 
is positively definite, and we do not introduce any assumption on the asymptotic 
behavior of influence indices. 
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3. Due to the matching moments up to second order, one has that 

E [Qd. {N^^ ,f^\G)x Q,^ (iV(^-) Ji'\G)] = E [Q,^ {N^^ , /« , X) x Q,^ (iV^') , Z^') , X)] 

for every i,j = 1, ...,m and every sequence X as at Point (2) of Theorem II .Si 

Plainly, the crucial implication in the statement of Theorem 11.31 is (1) =^ (2), which 
basically ensures that any statement concerning the asymptotic normality of (vectors 
of) general homogeneous sums can be proved by simply focussing on the elements of a 
Gaussian Wiener chaos. Since Central Limit Theorems (CLTs) on Wiener chaos are by 
now completely characterized (thanks to the results proved in [22l [2ll [28l [29l El]), this 
fact represents a clear methodological breakthrough. As explained later in the paper, and 
up to the restriction on the third moments appearing at Point (2), we regard Theorem 
11.31 as the first exact equivalent - for homogeneous sums - of the usual CLT for linear 
functionals of i.i.d. sequences. The proof of Theorem [US] is completely achieved in Section 

m 

Remark 1.5 1. The use of the techniques developed in [20] makes it unavoidable to 
require a uniform bound on the third moments of X. However, one advantage of 
using Lindeberg-type arguments is that we obtain convergence in the Kolmogorov 
distance, as well as explicit upper bounds on the rates of convergence. We will see 
below (see Theorem 11.121 for a precise statement) that in the one- dimensional case 
one can simply require a bound on the moments of order 2 + e, for some e > 0. 
Moreover, still in the one-dimensional case and when the sequence X is i.i.d., one 
can alternatively deduce convergence in distribution from a result by Rotar' [101 
Proposition 1], for which the existence of moments of order greater than 2 is not 
required. 

2. In this paper we will apply the techniques of [20] in order to evaluate quantities 
of the type |£'[Qrf(X)'"] — £'[Qrf(Y)'^] |, where m ^ 1 is an integer and X and Y 
are two sequences of independent random variables. From a purely methodological 
standpoint, we believe that this technique may replace the (sometimes fastidious) 
use of "diagram formulae" and associated combinatorial devices (see [331 US])- See 
Section m for a full discussion of this point, especially Lemma F4.4[ 

1.4 The role of contractions 

The universality principle stated in Theorem 11.31 is based on [20], as well as on general 
characterizations of (possibly multidimensional) CLTs on a fixed Wiener chaos. Results 
of this kind have been first proved in [29] (for the one-dimensional case) and [3l] (for 
the multidimensional case), and make an important use of the notion of "contraction" 
of a given deterministic kernel (see also [2H])- When studying homogeneous sums, one is 
naturally led to deal with contractions defined on discrete sets of the type [N^, N ^ 1. 
As already pointed out in [25], these objects enjoy a number of useful combinatorial 
properties, related for instance to the notion of fractional dimension or fractional Cartesian 
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product (see Blei [T]). In this section, we shall briefly explore these issues, in particular by 
pointing out that discrete contractions are indeed the key element in the proof of Theorem 
11.31 More general statements, as well as complete proofs, are given in Section El 

Definition 1.6 Fix d,N 2. Let / : [N^ ^ M be a symmetric function vanishing of 
diagonals. For every r = 0, d, the contraction / t^t^ / is the function on [A^]^'^~^^ given 
by 

Ar/(jl,--,j2d-2r) (1-12) 

= ^ /(Ol, Or, jl, jd-r)/(ai, Or, jd-r+1, j2d-2r)- 
l^ai,...,a,-SCA'' 

Observe that is not necessarily symmetric and not necessarily vanishes on diagonals. 
The symmetrization of / / is written fi^rf- 

The following result, whose proof will be achieved in Section [3, as a special case of 
Theorem 17.5^ is based on the flndings of [SSj and [31] . 

Proposition 1.7 (CLT for chaotic sums) Let the assumptions and notations of The- 
orem I i . 3l prevail, and suppose moreover that, for every i, j = 1, ...,m (as n oo) 

E[QdXNfjli\G) X g,^,(iV(^-),/(^-),G)] ^ (1.13) 

where V is a non-negative symmetric matrix. Then, the following three conditions are 
equivalent, as n oo. 

(1) The vector {Qdj {Nn \ fn \ G) : j = 1, m} converges in law to a centered Gaussian 
vector with covariance matrix V . 

(2) For every J = l,...,m, E[Qd^{Ni'\ fi'\Gy] ^ 3V{z,i)^. 

(3) For every j = 1, m and every r = 1, dj - 1, \\fi^'' -kr fn\\2dj-2r 0. 



Remark 1.8 Strictly speaking, the results of [31] only deal with the case where V is 
positive deflnite. The needed general result will be obtained in Section [7] by means of 
Malliavin calculus. 

Let us now briefly sketch the proof of the implication (1) ^ (2) in Theorem 11.31 
Suppose that the sequence in fll.lOj) is bounded and that Point (1) in Theorem 11.31 is 
verifled. Then, by uniform integrability (use Proposition 12. 7p . the convergence fll.l3p is 
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satisfied and, according to Proposition 11.71 we have \\fn^ fri'\\2 — * 0. The crucial 
remark is now that 



In 112 



E 



i=l 



^ /^•)(z,^2,...,^.,)/l^■^A;,^2,...,^.,) 



(j) 



is;j2,...,^.s;A'^'' 



^ max 

i=l,...,Ni''> 



^ ^ /n 



i<i2,...,^.s;7v^'' 



(rf, -1)! max. Inf,(/(^) 



l<i<N, 



(recall formula (11.81) ). from which one immediately obtains that, as n — oo, 



max 



l<i<N, 



lnii{f^^^) — i> 0, for every j = 1, ...,m. 



:i.i4) 



:i.i5) 



The proof of Theorem ll.3l is concluded by using Theorem 17. 11 which is a multi-dimensional 
version of the findings of [ID]- Indeed, this result will imply that, if (11.151) is verified 
then, for every sequence X as in Point (2) of Theorem 11.31 the distance between the law 
of {Q,^{Ni'\fi'\G) : J = l,...,m} and the law of {Q,^{Ni'\ fi'\x) : j = l,...,m} 
necessarily tends to zero, and therefore the two sequences must converge in distribution 
to the same limit. 

As proved in [21], contractions play an equally important role in the chi-square ap- 
proximation of the laws of elements of a fixed chaos of even order. Recall that a random 
variable Z^, has a centered chi-square distribution with u ^ 1 degrees of freedom (noted 

~ X^i'^)) if Z,y ^= X]r=i i^i ~ l)' where [Gi, ...,G^) is a vector of i.i.d. centered 
Gaussian random variables with unit variance. Note that 

E{Zl) = 2u, E{Zl) = 8z/, and E{Z^) = Vlv^ + 48i/. (1.16) 

The following result is proved in [2Tj . 

Theorem 1.9 (Chi-square limit theorem for chaotic sums) Let G = {Gi : % ^ 

\\ he a standard centered i.i.d. Gaussian sequence, and fix an even integer d ^ 2. 
Let {Nn, fn : n ^ 1} be a sequence such that {Nn : n ^ 1} is a sequence of inte- 
gers going to infinity, and each fn '■ [NnY ^ ^ is symmetric and vanishes on diago- 
nals. Define Qd{Nn, fn,G) , n ^ 1, according to ( (i.ij) . and assume that, as n ^ oo, 
E[Qd{Nni fn, G)^] — > 2z/. Then, as n ^ oo, the following three conditions are equivalent. 
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(1) g,(iV„,/„,G)"A"Z,~x'(i.). 

(2) E[Qd{Nn, /„, G)^] - l2E[Qd{Nn, /„, G)^] ^ E[Zt] - l2E[Zl] = I2u^ - 48z/. 

(3) ||/„^d/2/n. - Q X /„||d ^ and \\fn*r fn\\2d-2r = for every r = 1, ...,d - 1 such 
that r 7^ d/2, where 

1 4 

Q := = 7). (1-17) 



A neat application of Theorem 11.91 is sketched in the next section. 



1.5 Example: revisiting de Jong criterion 

To further clarify the previous discussion (and to motivate the reader), we provide an 
illustration of how one can use our results in order to refine a remarkable result by de 
Jong, originally proved in [6]. 

Theorem 1.10 (See jB]) Let X = {Xi : i 1} be a sequence of independent centered 
random variables such that E{Xf) = 1 and E{Xf) < oo for every i. Fix d ^ 2, and let 
{Nn, fn : n ^ 1} be a sequence such that {Nn : n ^ 1} is a sequence of integers going 
to infinity, and each fn ■ [NnY ^ ^ is symmetric and vanishes on diagonals. Define 
Qd{n,X.) = Qd{Nn, fn,^) , 1, according to U.l\) . Assume that ^'[(^^(n, X)^] = 1 for 
all n. If, as n ^ oo, 

(i) E[gd(n,X)4] ^3, and 

(ii) maxi^i^^„ Y.i^i.,,...,i^^Nr. fnih id) = {d - 1)\ maxi^,^jv„ Inf^(/„) ^ 0, 

then Qd{n,^) converges in law to a standard Gaussian random variable Z ~ ^(0, 1). 

Remark 1.11 As shown in [6], the conclusion of Theorem II. 101 extends to homogeneous 
sums that are not necessarily multilinear. Also, the fact that E{Xf) < oo does not appear 
in [0] (however, an inspection of the proof of Theorem II.IUI shows that this assumption is 
indeed necessary). 

In the original proof given in [H], Assumption (i) in Theorem II.IUI appears as a con- 
venient (and mysterious) way of reexpressing the asymptotic "lack of interaction" be- 
tween products of the type ■ ■ ■ Xi^, whereas Assumption (ii) plays the role of a usual 
Lindeberg-type assumption. In the present paper, under the slightly stronger assumption 
that P := sViPj^E{Xf) < oo, we will be able to produce bounds neatly indicating the 
exact roles of both Assumption (i) and Assumption (ii). To see this, define according 
to fll.51) . and set Jif to be the class of thrice differentiable functions whose first three 
derivatives are bounded by some finite constant B > 0. In Section [5l in the proof of The- 
orem [5]T] we will show that there exist universal, explicit, finite constants Ci,C2,C3 > 0, 
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depending uniquely on /5, d and B, such that (writing G for an i.i.d. centered standard 
Gaussian sequence) 



dji^^{Qd{n,X);Qd{n,G)} ^ Ci x ( max Infi(/„) + / max Infi(/„) 




d^.{grf(n,G);Z} ^ C2X y/\E[Qd{n,Gy]-3\ ( 



(1.18) 
(1.19) 



( 




(1.20) 



In particular, the estimates fll.181) and fll.20p show that Assumption (ii) in Theorem 
11.101 ensures that both the laws and the fourth moments of Qd{n,X.) and Qd{n,G) are 
asymptotically close: this fact, combined with Assumption (i) implies that the LHS of 
( I1.19P converges to zero. This gives an alternate proof of Theorem 11.101 in the case of 
uniformly bounded fourth moments. 

Also, by combining the universality principle stated in Theorem 11.31 with (I1.19P (or, 
alternatively, with Proposition 11.71 in the case m = 1) and with [101 Proposition 1], one 
obtains the following "universal version" of de Jong's criterion. 

Theorem 1.12 Let G = {Xi : i ^ 1} he a centered i.i.d. Gaussian sequence with unit 
variance. Fix d ^ 2, and let {Nn, fn'-n^ 1} he a sequence such that {Nn : n ^ 1} is a 
sequence of integers going to infinity, and each fn '■ [NnY ^ ^ is symmetric and vanishes 
on diagonals. Define Qd{n,G) = Qd{Nn, fn,G) , n ^ 1, according to M.l\] . Assume 
that E[Qfi{n, G)^] ^1 as n ^ 00. Then, the following four properties are equivalent as 
n — > 00. 

(1) The sequence Qdi^, G) converges in law to Z ^ ^(0, 1). 



(3) For every sequence X = {Xi : i ^ 1} of independent centered random variahles 
with unit variance and such that supj < 00 for some e > 0, the sequence 
Qd{n,X.) converges in law to Z ^ -^(0, 1) in the Kolmogorov distance. 

(4) For every sequence X = {Xi : i ^ 1} 0/ independent and identically distrihuted 
centered random variahles with unit variance, the sequence Qd{n,X.) converges in 
law to Z ^ ^(0, 1) (not necessarily in the Kolmogorov distance). 

Remark 1.13 1. Note that at point (4) of the above statement we do not require 
the existence of moments of order greater than 2. We will see that the equivalence 
between (1) and (4) is partly a consequence of Rotar's results [40] . 

2. Theorem 11.121 is a particular case of Theorem 11.31 and can be seen as refinement of 
de Jong's Theorem 11.101 in the sense that: (i) since several combinatorial devices 



(2) 



E[Qd{n,Gf^ 
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are at hand (see e.g. [33]), it is in general easier to evaluate moments of multilinear 
forms of Gaussian sequences than of general sequences, and (ii) when the {Xi} are 
not identically distributed, we only need existence (and uniform boundedness) of 
the moments of order 2 + e. 

In Section [71 we will generalize the content of this section to multivariate Gaussian 
approximations. By using Proposition 11.91 and [iQl Proposition 1], one can also obtain 
the following universal chi-square limit result. 

Theorem 1.14 We let the notation of Theorem prevail, except that we now assume 
that d 2 is an even integer and E[Qfi{n, G)^] 2v, where v 1 is an integer. Then, 
the following conditions are equivalent as n oo. 

(1) The sequence Qdin, G) converges in law to Zy ~ X^i^)- 

(2) E[Q,{n, G)4] - 12E[Q,{n, Gf] ^ E{Zt) - l2E{Zl) = I2u^ - 48z/. 

(3) For every sequence X = {X^ : i ^ 1} of independent centered random variables 
with unit variance and such that supj E'lXjp"'"^ < oo for some e > 0, the sequence 
Qd(n,X.) converges in law to Z^. 

(4) For every sequence X = {Xi : i ^ 1} of independent and identically distributed 
centered random variables with unit variance, the sequence Qd{n,^) converges in 
law to Zy. 

1.6 Two counterexamples 

1.6.1 There is no universality for sums of order one 

One striking feature of Theorem 11.31 and Theorem 11.121 is that they do not have any 
equivalent for sums of order d = 1. To see this, consider an array of real numbers 
{/„(z) : 1 ^i ^n} such that ^"^^ f^{i) = 1. Let G = {G^ : z > 1} and X = {Xi : 
1^1} be, respectively, a centered i.i.d. Gaussian sequence with unit variance, and 
a sequence of independent random variables with zero mean and unit variance. Then, 
(5i(n, G) := Yl^=i fnii)Gi ~ o/(^(0, 1) for every n, but it is in general not true that 
Qi(n, X) := Yl^=i fn{i)Xi converges in law to a Gaussian random variable. An example 
of this situation is obtained by taking Xi to be non-Gaussian, /n(l) = 1 and fn{j) = for 
J > 1. As it is well-known, to ensure that Qi(n, X) has a Gaussian limit one customarily 
adds the Lindeberg-type requirement that maxi<jj^„ l/n(OI ~^ (see e.g. [Ill Th. 4.12]). 
A closer inspection indicates that the fact that no Lindeberg conditions are required in 
Theorem 11.31 and Theorem II. 121 is due to the implication (1) =^ (3) in Proposition 11.71 as 
well as to the inequality fll.l4p . 
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1.6.2 Walsh chaos is not universal 

We shall now prove that one cannot replace the Gaussian sequence G with a Rademacher 
one in the statements of Theorem 11.31 and Theorem I1.12[ Let X = {Xi : z ^ 1} be an 
i.i.d. Rademacher sequence, and fix d ^ 2. For every N d, consider the homogeneous 
sum 



It is easily seen that each QdiN, X) can be written in the form (11.11) . for some symmetric 
f = In vanishing on diagonals and such that (i!||/Ar||^ = 1- Since X1X2 ■ ■ ■ X^-i is 
a random sign independent of {Xj : i ^ d}, a. simple application of the Central Limit 

Theorem yields that, as ^ 00, Qd{N,X.) ^(0,1). On the other hand, a direct 
computation reveals that, for every N ^ d, maxi<gj<cAr Infj(/Ar) = (l/d!)^. This shows that 
homogenous sums associated with Rademacher sequences are not universal. For instance, 
by replacing X in (11.211) with a i.i.d. standard Gaussian sequence G = {Gi : z ^ 1}, one 
sees that, for every N ^ 2, 



Since (for d ^ 2) the random variable Gi ■ ■ ■ is not Gaussian, this yields that the 
sequence Qd{N, G), N 1, does not verify a Central Limit Theorem. 

1.7 Three caveats on generality 

In order to enhance the readability of the forthcoming material, we decided not to state 
some of our findings in full generality. In particular: 

(i) It will be clear later on that the results of this paper easily extend to the case of 
iiiGnite homogeneous sums (obtained by putting N = +00 in (II. ip ). This requires, 
however, a somewhat heavier notation, as well as some distracting digressions about 
convergence. 

(ii) Our findings do not hinge at all on the fact that N is an ordered set: it follows that 
our results exactly apply to homogeneous sums of random variables indexed by a 
general finite set. 

(iii) Our results on chi-square approximations could be slightly modified in order to 
accommodate approximations by arbitrary centered Gamma laws. To do this, one 
can use the results about Gamma approximations proved in [211 ES] . 




(1.21) 




(1.22) 
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2 Wiener chaos 



In this section, we briefly introduce the notion of (Gaussian) Wiener chaos, and point out 
some of its crucial properties. The reader is referred to [271 Ch. 1] or [TT], Ch. 2] for any 
unexplained definition or result. 



2.1 Definitions 

Let G = {Gi : z ^ 1} be a sequence of i.i.d. centered Gaussian random variables with 
unit variance. 

Definition 2.1 (Hermite polynomials and Wiener chaos) 1. The sequence of Her- 
mite polynomials {Hg : g ^ 0} is defined as follows: Hq = 1, and, for q ^ 1, 

Recall that the class {{qiy^^^Hg : q ^ 0} is an orthonormal basis of 

L\R, (27r)"i/V^'/2^x). 

2. A multi-index q = {qi : i ^ 1} is a sequence of nonnegative integers such that qi ^ 
only for a finite number of indices i. We also write A to indicate the class of all 
multi-indices, and use the notation |g| = J2i>i liy every g G A. 

3. For every d ^ 0, the dth Wiener chaos associated with G is defined as follows: 
Co = M, and, for d ^ 1, is the L^(P)-closed vector space generated by random 
variables of the type 

oo 

$(g) = Y[HgXG^), g G A and |g| = d. (2.23) 

i=l 

Example 2.2 (i) The first Wiener chaos Ci is the Gaussian space generated by G, 
that is, F G Ci if and only if 

oo 

F = XiGi, for some sequence {Aj : z ^ 1} G 

i=l 

(ii) Fix d,N ^ 2 and let / : [A^]'^ — M be symmetric and vanishing on diagonals. Then, 
the (i-homogeneous sum 

Qd{G)=d\ Yl f{iu...,id)G,,---G,,= /(^i,-,^d)Gi,---Gi, (2.24) 

{ii,...,id}C.[NY l^ii,...,id^N 

is an element of C^- 
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It is easily seen that two random variables belonging to Wiener chaoses of different 
orders are orthogonal in L^(P). Moreover, since linear combinations of polynomials are 
dense in L^(P, a{G)), one has that I/^(P, cr(G)) = 0(^>o Cd, that is, any square integrable 
functional of G can be written as an infinite sum, converging in and such that the 
dth summand is an element of Cd- This representation result is known as the Wiener-Ito 
chaotic decomposition of L^(P, (t(G)). 

It is often useful to encode the properties of random variables in the spaces Cd by 
using increasing tensor powers of Hilbert spaces (see e.g. [TT], Appendix E] for a collection 
of useful facts about tensor products). To do this, introduce a (arbitrary) real separable 
Hilbert space and, for d ^ 2, denote by i^®'^ (resp. ^®'^) the dth tensor power (resp. 
symmetric tensor power) of Sj; write moreover S)'^^ = = M and i^®^ = i^®^ = S^. Let 
{cj : j ^ 1} be an orthonormal basis of Sj. With every multi-index g G A, we associate 
the tensor e(g) G i^®!^! given by 

e{q) = el^'' ® ■ ■ ■ ®e,^^'^ 

where {gj^, gj^,} are the non-zero elements of q. We also denote by e(g) G i^®'''' the 
canonical symmetrization of e{q). It is well-known that, for every d ^ 2, the collection 
{e(g) : g G A, |g| = rf} defines a complete orthogonal system in ^®'^. For every d ^ 1 and 
every h G i^®*^ with the form h = J2qeA \g\=d^<i^il)y we define 

Idih)= ^«*(^)' (2-25) 

qeA,\q\=d 

where $(g) is given in fl2.23p . The following result (see e.g. [2Z1 p. 8] for a proof) 
characterizes Id as an isomorphism. 

Proposition 2.3 For every d 1, the mapping Id '■ ^®'^ Cd (as defined in Ii2.25\) ) is 
onto, and provides an isomorphism between Cd and the Hilbert space i^®*^, endowed with 
the norm \/d}.\\ ■ \\f,s)d. In particular, for every h, h' G ^®'^, 

E[Id{h)Id{h')]=d\{h,h')s,^a. 

As proved e.g. in [271 Section 1.1.2], if ^ = L'^{A, £/ , fi), with fi a-finite and non-atomic, 
then the operators Id are indeed (multiple) Wiener-Ito integrals. 

Example 2.4 (i) By definition, Ci = /i(ej), for every z ^ 1. 

(ii) The random variable Qd{G) defined in f l2.24p is such that 

Qd{G) = Id{h), where h = d\ Yl f{^l,...,^d)e^,®■■■0e,^e^'^''. (2.26) 

{H,...,ia}c[N]'' 
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2.2 Multiplication formulae and hypercontractivity 

The notion of "contraction" is the key to prove the general bounds stated in the forth- 
coming Sectional 

Definition 2.5 (Contractions) Let {e^ : i ^ 1} be a complete orthonormal system in 
Sj. Given / G Sj^^ and g G S)®'^, for every r = 0, . . . ,p A q, the rth contraction of / and g 
is the element of defined as 

oo 

f®rg= ^ {f,ei,(g) ...(g)ei^)sj<sr(g) {g,ei^(g) ...(g)ei^)sj<sr. (2.27) 

ii,...,ir=l 

Plainly, f ®o g = f ® g equals the tensor product of / and g while, for p = g, f ®p g = 
{f,g)fj<Dp- Note that, in general (and except for trivial cases), the contraction / 0^ is 
not a symmetric element of 5^®{p+'?-2r)_ rj^j.^^ canonical symmetrization oi f ®rg is written 

f®rg- 

Contractions appear in multiplication formulae like the following one (see [271 Prop. 
1.1.3] for a proof). 

Proposition 2.6 (Multiplication formulae) /// G S)®^ and g G i^®'', then 

W)h{9) = Q W2r.(M.^7). (2.28) 

Note that the previous statement implies that multiple integrals admit finite moments 
of every order. The next result (whose proof is given e.g. in [HI Th. 5.10]) establishes 
a more precise property, namely that random variables living in a finite sum of Wiener 
chaoses are hypercontractive. 

Proposition 2.7 (Hypercontractivity) Let d ^ 1 be a finite integer and assume that 
F G ®f=o Fix reals 2 ^ p ^ q < oo. Then, 

E[\F\'']-^ <: {q - 1)^ E[\F\P]p . (2.29) 

We will use hypercontractivity in order to deduce some of the bounds stated in the 
next section. 

3 Normal and chi-square approximation on Wiener 
chaos 

Starting from this section, and for the rest of the paper, we adopt the following notation 
for distances between laws of real- valued random variables. 
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- The symbol dTv{F,G) indicates the total variation distance between the law of F 
and G, obtained from fll.Sp by taking Jif equal to the class of all indicators of the 
Borel subsets of M. 



- The symbol dw{F,G) denotes the Wasserstein distance, obtained from (11.51) by 
choosing as the class of all Lipschitz functions with Lipschitz constant less or 



- The symbol dBw{F,G) stands for the bounded Wasserstein distance (or Fortet- 
Mourier distance), deduced from (ll.Sp by choosing as the class of all Lipschitz 
functions that are bounded by 1, and with Lipschitz constant less than or equal to 
1. 

While dKoi{F,G) ^ dTviF,G) and dBw{.F,G) ^ dw{F,G), in general dTv{F,G) and 
dw{F,G) are not comparable, see [8] for an overview. 

In what follows, we consider as given an i.i.d. centered standard Gaussian sequence 
G = {Gi : i ^ 1}, and we shall adopt the Wiener chaos notation introduced in Section [21 

3.1 Bounds on the normal approximation 

In the recent series of papers [22l [231126] . it has been shown that one can effectively combine 
Malliavin calculus with Stein's method, in order to evaluate the distance between the law 
of an element of a fixed Wiener chaos, say F, and a standard Gaussian distribution. In 
this section, we state several refinements of these results, by showing in particular that 
all the relevant bounds can be expressed in terms of the fourth moment of F. The proofs 
involve the use of Malliavin calculus and are deferred to Section [HI 

Theorem 3.1 (Fourth moment bounds) Fix d ^ 2. Let F = Id{h), h E S)®'^, he an 
element of the dth Gaussian Wiener chaos G^ such that E{F'^) = 1, let Z ~ ^(0, 1), and 
write 



equal to 1. 




2 

J=,®2{d-r) 



We have 



Ti(F) ^T2(F). 



(3.30) 



Moreover, the following bounds hold: 



1. 



dTv{F,Z)^2T,{F). 



(3.31) 
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2. 



dBw{F,Z)^dw{F,Z)<:Ti{F). 



(3.32) 



3. Let (/? : M — >• M 6e a thrice differentiable function such that \\(p' 



III 



< oo. Write 



oo 



K^{\^'m,\^"{o)iy"'\\oo) 

max |2 \v" (0)1 + ^y^^; 2 \v' (0)| + ^ ||^'"IL 
C,{d,\if'm,\ip"iO)iy"'m = {3 + i2V2f) xK,. 



(3.34) 



(3.33) 



Then, one has that 



E[^{F)]-E[^{Z)]\<:axT,{F). 



Proof. See Section E31 



Remark 3.2 If E[F) = and F has a finite fourth moment, then the quantity K4^{F) = 
Ei^F'^) — 3E{F^)'^ is known as the fourth cumulant of F. See e.g. [13] for a general 
introduction to cumulants. One can also prove (see e.g. [22]) that, if F is a non-zero 
element of the dth Wiener chaos of a given Gaussian sequence {d ^ 2), then ^^{F) > 0. 

3.2 Central limit theorems 

Now fix (i ^ 2, and consider a sequence of random variables of the type Fn = Id{hn), 
n ^ 1, such that, as n ^ cxd, E{F^) = 1. In [21] it is proved that the 

following double implication holds: as n — > cx), 

\\hn^rhn\\fj^2(d-r) 0, Vr = 1, d - 1 \\hn<^rhn\\^md-r) ^ 0, Vr = 1, d-1. 



Theorem 13. H combined with 03.351) . allows therefore to recover the following characteriza- 
tion of CLTs on Wiener chaos. It has been first proved (by completely different methods) 



Theorem 3.3 (See | I28[ , 129] ) Fix d ^ 2, and let Fn = Id{hn), n 1 be a sequence in the 
dth Wiener chaos ofG. Assume that lim^^oo -E'(-^n) = 1- Then, the following conditions 
are equivalent, as n ^ oo. 

(1) Fn converges in law to Z ^{0, !)■ 



(3.35) 



in m 



(2) E{F^)^E{Z')=3. 



(3) For every r = 1, d ~ 1, \\hn^rh. 




0. 
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Proof. Since sup„ E{F^) < oo, one deduces from Proposition 12 .71 that, for every M > 2, 
sup„ I -F„ I < oo. By uniform integrability it follows that, if (1) is in order, then 
necessarily E{F^) E^Z'^) = 3. The rest of the proof is a consequence of the bounds 
f l3.30p - fl3.3ip and of the fact that the topology induced by the total variation distance (on 
the class of all probability measures on M) is stronger than the topology of convergence 
in distribution. ■ 

The following (elementary) result is one of the staples of the present paper. It relates 
the Hilbert space contractions of formula fl2.27p with the discrete contraction operators de- 
fined in (11.121) . We state it in a form which is also useful for the chi-square approximation 
of the forthcoming Section 13.31 

Lemma 3.4 Fix d ^ 2, and suppose that h G S)®'^ is given by Ii2.26\) . with f : [A^]^ M 
symmetric and vanishing on diagonals. Then, for r = 1, ...,d — 1, 

\\h®r h\\f^s,(2d-2r) = ||/*r /||2d-2r, (3.36) 

where we have used the notation 1^1.12) . Also, if d is even, then for every ai,a2 G M 

\\ai{h ®d/2 h) + a2/i|U»d = ||ai(/ *d/2 /) + "2/11^- (3.37) 

Proof. Fix r = 1, d — 1. Using f l2.26p and the fact that {cj : j ^ 1} is an orthonormal 
basis of i3, one infers that 



'«2d-2r 



h®rh= ^ /(zi, ...,Zd)/(ji, jd)[ei, ® ■ ■ ■ ® e 

= ^ ^ f{ai,...ar,ki,...,kd-r)'X 

l^ai,...,ar^N l^ki,...,k2d-2r^N 

x/(ai, ...,ar, kd-r+1, k2d-2r)eki ®- ■ ■ ®ek, 
= /^r/(/^l,---,/^2d-2r)efci ® ■■■®efc2^_2^. (3.38) 

Since the set {e^j ® ■ ■ ■ ® e^j^ ,,^ : ki, /c2d-2r ^ 1, mutually distinct} is an orthonormal 
basis of 53®(2<^-2r-)^ Qj^g deduces immediately f l3.36p . The proof of f l3.37p is analogous. ■ 

Remark 3.5 Theorem 13.31 and formula (13.361) yield immediately a proof of Proposition 
11.71 in the case m = 1. 



3.3 Bounds on the chi-square approximation 

As demonstrated in [211 [22]. the combination of Malliavin calculus and Stein's method also 
allows to estimate the distance between the law of an element F of a fixed Wiener chaos 
and a (centered) chi-square distribution x^i'^) with u degrees of freedom. Analogously to 
the previous section for Gaussian approximations, we now state a number of refinements 
of the results proved in [211 [22] • In particular, we will show that all the relevant bounds 
can be expressed in terms of a specific linear combination of the third and fourth moments 
of F. Once again, the proofs are deferred to Section [HI 
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Theorem 3.6 (Third and fourth moment bounds) Fix an even integer d ^ 2 as 
well as an integer z/ ^ 1. Let F = Id{h) be an element of the dth Gaussian chaos Cd such 
that E^F"^) = 2u, let ~ X^(^^) > ^'^^ write 



UF) 



h 



d\' 



4(rf/2)! 



+d^ J2 (^-l)!'^!^ !) {2d-2r)\\\h®X^ 



r=l,...,d-l 



r — 1 



fj02(d-r) 



d-1 

3d 



\E{F^) - 12E(F3) - 12z/2 + 48z/|. 



Then, 



TsiF) ^ n{F) 



(3.39) 



and 




'2ti 1 2 

dBw{F. Z,) ^ max <( W — , - + — }> r3(F). 



(3.40) 



Remark 3.7 Observe that 



4{d/2)!3 a 



— , where Cd is given in (11.171) . 



Proof. See Section [831 



3.4 Chi-square limit theorems 

Now fix an even integer d ^ 2, and consider a sequence of random variables of the type 
Fn = Id{hn), n ^ 1, such that, as n ^ oo, E{F^) = d\\\hn\\1j^d 2u. In [21] it is proved 
that the foUowing double implication holds: as n ^ oo, 

\\hn®rhn\\^m{d-r) -> 0, Vr = 1, c/ - 1, T ^d/2 

\\hn®rhn\\fjiS2(d-r) 0,yr = 1, ...,d - 1, r ^ d/2. (3.41) 

Theorem 13.61 combined with (13.411) . allows therefore to recover the following character- 
ization of chi-square limit theorems on Wiener chaos. Note that this is a special case 
of a 'non-central limit theorem'; one usually calls 'non-central limit theorem' any result 
involving convergence in law to a non-Gaussian distribution. 

Theorem 3.8 (See |21] ) Fix an even integer d ^ 2, and let F„, = Id{hn), n ^ 1 be a 
sequence in the dth Wiener chaos of G. Assume that limn^ao E{F^) = 2u. Then, the 
following conditions are equivalent, as n ^ oo. 

(1) Fn converges in law to a Z^ X^{^)- 
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(2) E{F^) - 12E{F^) ^ E{Zt) - l2E{Zl) = I2v^ - ASv. 



(3) \\hn®d/2hn — Cd X hnWf^m — s> (for Cd as given in and, for every r = 1, d—1 

such that r ^ d/2, \\hn®rhn\\ f^mid-r) — > 0. 

Proof. Since sup„-E(-F'^) < oo, one deduces from Proposition 12.71 that, for every 
M > 2, sup„i?|F„|^^ < oo. By uniform integrability it follows that, if (1) holds, then 
necessarily E{F^) - 12E{F^) E{Z^) - l2E{Zl) = 12u^ - 48z/. The rest of the proof 
is a consequence of the bound fl3.39p and of the fact that the topology induced by the 
bounded Wasserstein distance (on the class of all probability measures on M) is equivalent 
to the topology of convergence in distribution. ■ 

Remark 3.9 By using fl3.37p in the case ai = 1 and a2 = —Cd, one sees that Theorem 
13.81 yields an immediate proof of Proposition 11.91 

4 Low influences and proximity of homogeneous sums 

We now turn to some remarkable invariance principles by Rotar' [40] and Mossel, O' Don- 
nell and Oleszkiewicz [20] • As already discussed, the results proved in [10] yield sufficient 
conditions in order to have that the laws of homogeneous sums (or, more generally, poly- 
nomial forms) that are built from two different sequences of independent random variables 
are asymptotically close, whereas in [20] one can find explicit upper bounds on the dis- 
tance between these laws. Since in this paper we adopt the perspective of deducing general 
convergence results from limit theorems on a Gaussian space, we will state the results of 
[10] and [20] in a slightly less general form, namely by assuming that one of the sequences 
is i.i.d. Gaussian. See also Davydov and Rotar' [1], and the references therein, for some 
general characterizations of the asymptotic proximity of probability distributions. 

Theorem 4.1 (See |20] ) Let X = {Xj, i 1} be a collection of centered independent 
random variables with unit variance, and let G = {Gi, i 1} be a collection of standard 
centered i.i.d. Gaussian random variables. Fix d 1, and let {Nn, fn : n ^ 1} be a 
sequence such that {Nn : n ^ 1} is a sequence of integers going to infinity, and each 
fn '■ [NnY ^ IR «s symmetric and vanishes on diagonals. Define Qd{Nn, fn,^) and 
Qd{Nn, fn, G) according to 

1. // supj^i -EflXip"*"^] < oo for some e > and z/maxi^j^7v„ Infj(/n) as n oo 
(with Infj(/„) given by then we have 

sn^\P[Qd{NnJn,^)^z\-P[Qd{NnJn,G)^z\\^Q asn^^x,. (4.42) 

2. If the random variables Xi are identically distributed and if maxi^i^N^ Infj(/„) 
as n ^ oo, then 

|^[^(Qd(iVn,/n,X))] - E[il,{Qd{NnJn,G))] \ ^0 OS n ^ OO, (4.43) 
for every continuous and bounded function : M ^ M. 
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3. If [3 := supj>]^ < oo then, for all thrice differentiable <^ : M ^ M such that 
II V^'" Hoc < oo and for every fixed n, 

\E[^{Qd{Nr.,fn,X))]-Ey{Q,{N^Jr„G))]\ (4.44) 

^ ||<^'"||oo (30/3)'^d! / max Inf,(/„). 

Proof. Point 1 is Theorem 2.2 in [20]. Point 2 is Proposition 1 in [ID]. Point 3 is 
Theorem 3.18 (under Hypothesis H2) in [20]. Note that our polynomials Qd relate to 
polynomials d\Q in [20], hence the extra factor of d\ in I4.44[ ■ 

Remark 4.2 We stress that Point 3 in Theorem 14.11 is a non-asymptotic result. In 
particular, the estimate fl4.44p holds for every n independently of the asymptotic behavior 
of the quantities maxi^j^7v„ Infj(/n)- 

In the sequel, we will also need the following technical lemma, which is a combination 
of several results stated and proved in |20j . 

Lemma 4.3 Let X = {Xj, i ^ 1} be a collection of centered independent random vari- 
ables with unit variance. Assume moreover that 7 := supj>i -E'[|Xj|''] < 00 for some q > 2. 
Fix N,d ^ 1, and let f : [N^ — > M 6e a symmetric function (here, observe that we do not 
require that f vanishes on diagonals). Define Qd(X) = Qd{N, f,X.) by Then 

Proof. Combine Propositions 3.11, 3.16 and 3.12 in [20]. ■ 

As already evoked in the Introduction, one of the key elements in the proof of Theorem 
14.11 given in [20] is the use of an elegant probabilistic technique, which is in turn inspired 
by the well-known Lindeberg's proof of the Central Limit Theorem (see [E]). Very lucid 
accounts of Lindeberg's ideas can be found in Trotter [lU] and Feller [71 Section VIII]. As 
an illustration of the Lindeberg-type approach developed in [20], we will now state and 
prove a useful lemma, concerning moments of homogeneous sums. As suggested before, 
we believe that the employed technique may replace "diagram formulae" and similar 
combinatorial devices, that are customarily used in order to estimate moments of non- 
linear functional of independent sequences (see e.g. [331113], as well as the estimates on 
moments of homogeneous sums given in de Jong's paper [6]). 

Lemma 4.4 Let X = {Xj : i ^ 1} and Y = {Yi : i ^ 1} be two collections of centered 
independent random variables with unit variance. Fix some integers N,d 1, and let f : 
[A^]*^ — > M &e a symmetric function vanishing on diagonals. Define (5d(X) = Qd^N, /, X) 
and Qd(Y) = Qd{N, /, Y) according to U.l\) . 

1. Suppose k ^ 2 is such that: (a) Xj and belong to L^{Q) for all i ^ 1; (b) 
E{Xl) = E{Y}) for alli^l and I e {2, . . . ,k}. Then (5d(X) and QdiY) belong to 
L\^), and EiQdi'X.y) = E{QdiYy) for all I e {2, . . . , k}. 
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2. Suppose m > k ^ 2 are such that: (a) a := max { supj^j^ -E|Xjp, supj^i -ElFil™} < 
oo; (b) E{Xl) = Eiyl) for all i ^ 1 and I G {2,..., A;}. Assume moreover (for 
simplicity) that: (c) ElQ^i^YY^"^ ^ M for some finite constant M ^ 1. Then 
Qd(X.) and (5d(Y) belong to L"^{Q) and, for all I ^ {k + 1, . . . , m}, 



\E{Qd{Xy) - E{Qd{Y 

< Cd,l,ra,a X M^" 



X max < max 

l<i<N I 



Inf,(/)^;Inf,(/)^i 



where Cd. 



l,m,a = i^d-iy. ^"^ (2V^ — l) 



Proof. Point 1 could be verified by a direct (elementary) computation. However, we 
will obtain the same conclusion as the by-product of a more sophisticated construction 
(based on Lindeberg-type arguments) which will also lead to the proof of Point 2. We 
shall assume, without loss of generality, that the two sequences X and Y are stochastically 
independent. For i = 0, . . . , N, let denote the sequence (Yi, . . . ,Yi, Xj+i, . . . , X^). 
Fix a particular i G {1, . . . , A^}, and write 



l<ii,...,Jd^Af 



/(2l,...,Z,)Z«...Z 



f{^u...,^d)Z^...Z?...Z^ 



l<H,...,id^Af 

3k: i^:=i 

where Z^^^ means that this particular term is dropped (observe that this notation bears no 
ambiguity: indeed, since / vanishes on diagonals, each string ii, ...,id contributing to the 
definition of Vi contains the symbol i exactly once). Note that Ui and Vi are independent 
of the variables Xi and Yi, and that Qd(Z(^-i)) = Ui + XtVi and Qd{Z^^) = Ui + YiVi. By 
using the independence of Xi and Yi from Ui and Vi (as well as the fact that E{Xl) = E{Y^) 
for all i and all 1 ^ / ^ A;) we infer from the binomial formula that, for / G {2, . . . , A;}, 



E[{a-, + x,Vi)'] = ^(J)i?(t/l-^'^^)i^;(x/) 

= E ([)EiUl~^VnE{Y/) = E[{U. + Y,Vy]. 



(4.45) 



i=o 



That is, E[Qd(Z(^-i))'] = E[Qrf(ZW)'] for all i G {1, . . . , A^} and / G {2,..., A;}. The 
desired conclusion of Point 1 follows by observing that QdCZ^^^) = QdpQ and Qd{'Zi^'^^) = 
Qd(Y). 
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To prove Point 2, let / G {/c + 1, . . . ,m}. Using fl4.45p and then Holder's inequality, 
we can write 



J2 C)EiUt'V^){E{X!)-E(y7)) 
i=k+l 

^ E C){E\u^\'y''^\m\y^'{E\x^\'+m\')■ 



j=k+l ^-"^ 

But, by Lemma 113] and since E{U!^) ^ E{Qd(X.f) ^ M^, we have 

E\Uit ^ «"'/'"(2v/r^)''E(f/^)'/2 ^ a'^'/™(2v/I^)''M'. 
Similarly, since E(y^^) = (i!^Infj(/) (see (11. 8p ). we have 

Hence, since E\Yi\^ + E\X,\^ ^ 2a^/"', 

|E[g,(z(*-i))'] -E[g,(zW)']| 



j=k+i 

dl 



a 



dl/m 



a 



{d~l)/m. 



^ 2'+ia^(2Vr^)'^''"'^rf!'M'"^-i X max 



Inf,(/)^;Inf,(/)^ 



Finally, summing for i over 1, . . . , and using that ^j^^ Inf j(/) 
\E[Qa{X)']-E[Q,{Y)']\ 



{d~l)\ ^ d\{d-l)\ 



yields 



N 



X max <; max 

l<i<N 



^ Cd,i,ra,a X M' "" ^ X max <; max 

l<i<N 



Inf,(/)"f^;Inf,(/)|-i]}5^Inf,(/) 
Inf,(/)"^';Inf,(/)^i 



5 Normal approximation of homogeneous sums 
5.1 Bounds 

The following statement provides an explicit upper bound on the normal approximation 
of homogenous sums, when the test function has a bounded third derivative. 
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Theorem 5.1 Let X = {Xi, i ^ 1} be a collection of centered independent random 
variables with unit variance. Assume moreover that sup^ E{Xf) < oo and let a : = 
max{3 ; supiE{Xf)}. Fix N,d ^ 1, and let f : [A^]'^ ^ M. be symmetric and van- 
ishing on diagonals. Define Qd(X.) = Qd(iV, /, X) according to U.l\) and assume that 
E[Q(i(X.y] = 1. Let ip : M. be a thrice differentiable function such that \\(p"'\\oo ^ B. 

Then, for Z ~ ^(0, 1), we have 



E[^{Q,{1L))] - E[^{Z)]\ ^ B{30PYd\ Jmax lnUif) (5.46) 

/ \ 1/4' 

ElQdCXy] - 3| + aV2 X 144^^-^ a^Vddl f max Infi(/) 



3d 

with defined as in Theorem \3.1\ 

Proof. Let G = be a standard centered i.i.d. Gaussian sequence. We have 

\E[ip{Qa{yi))\-E[ip{Z)\ \ ^81 + 62 (5.47) 

wii\i6i = \E[ip{Qa{X))]-E[p{Qd{G))]\a.nd52=\E[ip{Qa{G))\ - E[p{Z)\\. By The- 
orem O point 3, we have 5i ^ fi(30/3)'^rf! v^maxi ^j:^Ar Infj(/). By Theorem |3JJ and since 
E[g,(X)2] = E[Q,{Gf] = 1, we have 



52^cJ^\E[Q,{GY]-?,\ 



3d 

By Lemma [4.41 point 2 (with M = 1, k = 2 and I = m = 4) and since Infj(/) ^ 1 for all 
i, we have 



\E[Qd(Xy] - E [Qd{GY] I ^ 32 X 144^'^^^ a'^dd\\/ max Infi(/) (5.48 



so that 



ElQdCXy] - 3| + 4^2 X 144*^^^ a^Vddl max Infi(/) 



3d 

Finally, the desired conclusion follows by putting the bounds for 61 and 62 in (15.471) . 



Remark 5.2 As a corollary of Theorem 15. 11 we immediately recover de Jong's Theorem 
I1.10[ under the additional hypothesis that supj-E(X^^) < 00. 
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5.2 Converse statements: universality of Wiener chaos 



Here, we prove a slightly stronger version of Theorem 11.121 stated in Section 11.51 More 
precisely, we have the following result, where an additional condition on contractions 
(see assumption (3) in Theorem 15.31 just below and Definition 11.61) have been added with 
respect to Theorem 11.121 making the criterion more easily applicable in practice. 

Theorem 5.3 Let G = {Gi : i 1} be a standard centered i.i.d. Gaussian sequence, 
and fix an even integer d ^ 2. Let {Nn, fn : n 1} be a sequence such that {Nn} is a 
sequence of integers going to infinity, and each fn '■ [NnY —^^is symmetric and vanishes 
on diagonals. Define Qd{n,G) = Qd{Nn, fniG) , n ^ 1, according to U.l\) . and assume 
that E[Qa{n, G)^] — 1 as n ^ oo. Then, the following are equivalent as n ^ oo. 

(1) The sequence Qdin, G) converges in law to Z ^(0, 1). 

(2) E[g,(n,G)4] ^3. 

(3) For allr = l,...,d- 1, -k,. fn\\2d-2r 0. 

(4) For every sequence X = {Xi : z ^ 1} o/ independent centered random variables 
with unit variance and such that sup^ E\Xi\'^~^^ < oo for some e > 0, the sequence 
Qdin,^) converges in law to Z ^ ^(0, 1), in the Kolmogorov distance. 

(5) For every sequence X = {X^ : i ^ 1} o/ independent and identically distributed 
centered random variables with unit variance, the sequence Qd{n,^) converges in 
law to Z ^ ^(0, 1) (not necessarily in the Kolmogorov distance). 

Proof. The equivalences (1) -v^ (2) (3) are a mere reformulation of Theorem 13.31 
deduced by taking into account the equality f l3.36p . On the other hand, it is trivial that 
each one of conditions (4) and (5) implies (1). So, it remains to prove the implication 
(1), (2), (3) (4), (5). Fix ^ e M. We have 

|P[g,(n,X) ^ ^] - P[Z ^ ^] I ^ 6^:\z) + 6'^\z) 

with 

6i^\z) = \P[Qd{n,X)<:z]-P[Qd{n,G)^z]\ 
6^J:\z) = \P[Qd{n,G)^z]-P[Z ^z]\. 

By assumption (2) and f l3.30p - fl3.3ip . we have snp^^^6n\z) 0. By combining assump- 
tion (3) (for r = d — 1) with fll.14p - fll.15p . we get that maxi<jj^7v„ Infj(/n) — > as n ^ cxo. 
Hence, Theorem 14.11 (Point 1) implies that sup^^]^6li\z) — > 0, and the proof of the impli- 
cation (1), (2), (3) =^ (4) is complete. To prove that (1) =^ (5), one uses the same line of 
reasoning, the only difference being that we need to use Point 2 of Theorem 14.11 instead 
of Point 1 . ■ 
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5.3 Bounds on the Wasserstein distance 

The following statement shows that our techniques allow to directly control the Wasser- 
stein distance between the law of a homogenous sum and the law of a standard Gaussian 
random variable. 

Proposition 5.4 As in Theorem 15. iL let X = {Xj, i ^ 1} be a collection of centered 
independent random variables with unit variance. Assume moreover that supj E{Xf) < oo 
and note a := max{3 ; supj Fix N,d ^ 1, and let f : [N^ ^ M. be symmetric 

and vanishing on diagonals. Define Qd(X.) = Qd{N, f,^) according to U.l\) and assume 
that E[Qd(X)2] = 1. Put 



B, = 2(30/3)'^d! /max Infi(/) 



and 



' d — 1 

B2 = 3x (3 + (2^2)'^) a/— ^ X 



3d 



X 



E[QdiX.Y] -3| +aV2x UA'^-^ a^y/dd\ f max Infi(/) 



Then, for Z ~ ^(0, 1), we have dwiQdi^), Z) ^ A{Bi + ^2)^ provided Bi + B2 ^ 

Proof. Let h G Lip(l) be a Lipschitz function with Lipschitz constant 1. By Rademacher's 
theorem, h is Lebesgue-almost everywhere differentiable. So, if we denote by h' its deriva- 
tive, then ||/i'||oo ^ 1- For t > 0, define 

POO 

ht{x) = / h{\/iy + Vl - tx)(t){y)dy, 



where denotes the standard normal density. The triangle inequality gives 

\E[h{Qd{X.))]-E[hiZ)]\ 
^ \E[ht{Qd{X.))]-E[ht{Z)]\ + \E[hiQdi'X.))]-E[ht{Qd{'X.))]\ + \E[h{Z)] - E[htiZ)]\. 

Now, let us differentiate and integrate by parts, using that (p'{x) = —x(f){x), to get 
h'lix) = / yh\Viy + VT^x)(j){y)dy. 



Vt 

Hence, for < t < 1, we may bound 

1 -t r°° 1 

\\h';\\^ ^ -^\\h'\Uj \y\^[y)dy^—. (5.49) 
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For < t ^ I (so that \/t ^ - 1), we have 
\E[h{Q,{X))] - E[ht{Qd{Xm 



E 



h{Viy + Vr^tQ^ix)) - hiVT^QdCx.))} Hy)dy 

+E [\h{VT^tQd{X)) - h{Q,{X))\] 

/oo 
\y\(j){y)dy+ \\h'\\^ 
-oo i- — t 



^ .E[mx)\]^^v~t. 



Similarly, \E[h{Z)] - E[ht{Z)]\ ^ |v^. 

We now apply Theorem 15 .![ To bound C*, we use that |/it(0)| ^ 1 and that |/i"(0)l ^ 
t~2; also ||/i'/'||oo ^ 2/t (as it can be shown by using the same arguments as in the proof 
of (15.491) above). Hence, as 2 ^ j and \/2 ^ t~2, we have 



a = (3 + {2V2y) X JC ^ (3 + {2V2Y) X max |2t^5 + 

^ (3 + (2v^)'^) X J. 

V 

Due to II /i'/' II 00 ^ 2/t, Theorem 15.11 gives the bound 

\E[ht{Q,{X))]-E[ht{Z)]\ ^ 3V~t+{B^ + B2)^. 



2 + -r^ 

3 



Minimising + {B\ + -82)7 in i gives that t = (|(-Bi + -B2)) • Plugging in the values 
and bounding the constant part ends the proof. ■ 



6 Chi-square approximation of homogeneous sums 
6.1 Bounds 

The next result provides upper bounds on the chi-square approximation of homogeneous 
sums. 

Theorem 6.1 Let X = {Xi, i ^ 1} be a collection of centered independent random 
variables with unit variance. Assume moreover that supj £'(X^') < 00 and note a : = 
max{3; supj £'(Xj^)}. Fix an even integer d ^ 2 and, for N ^ 1, let f : [N^ ^ M be 
symmetric and vanishing on diagonals. Define Qd{X) = Q^iN, f,X) according to ( fi. il) 
and assume that ElQ^iX)"^] = 2v for some integer v ^ 1. Let (/? : M ^ M 6e a thrice 
differentiable function such that ||v?||oo ^ 1> llv'lloo ^ 1 one? ||v5'"||oo ^ B. Then, for 
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X^(z^), we have 

\E[cpiQ,i^))]-E[cpiZ,)] 



^ B(30pYd\J max InUf) + max 




27r 1 _^ 2 



X 



rf- 1 

3d 



E[g,(X)4] - 12E[Q,(X)3] - 121.2 + 481 



+4v^d! (^V2 X 144'^-^ + v^(2v^)^^a^) f max Infi(/) 



1/4 



Proof. We proceed as in Theorem I5.1[ Let G = (6*4)1^1 denote a standard centered 
i.i.d. Gaussian sequence. We have |i?[v9(Qd(X))] — E^{Zy^ | ^ (5i + ^2 with 

h = \E[^{Q,m)] - E[^{Q,{G))]\ 



82 = \E[p{Qd{G))\-E[^{Z,) 
By Theorem 10 (Point 3), we have 5i ^ B{?>Ql3Yd\^/msxi ^i^N Iiif j (/) . By Theorem 13.61 
we have, with C# = max | -y/^, + ^ | • 



Additionally to ([53H1), we have, by Lemma (Point 2, with M = y/2u, k = 2, I = 3 
and m = 4): 



\E[Qdi^Y] - E[Q,iGf] \ ^ 16z/(2V2)3(2'^-i)axrfrf! /max inf,(/). 



Hence 



ElQdCXY] -12E[g,(X)3] -12z/2 + 48z/| 



+4:Vdd\ (^V2 X 144^^"^ + (2v^)^^^a^) f max Infi(/) 



1/4 



The desired conclusion follows. ■ 

As an immediate corollary of Theorem 16. we deduce the following new criterion for 
the asymptotic non-normality of homogenous sums - compare with de Jong's Theorem 
[TIOl 

Corollary 6.2 Let X = {Xj : i ^ 1} be a sequence of independent centered random 
variables with unit variance such that sup^ E{Xf) < 00. Fix an even integer d ^ 2, and 
let {Nn, /„ : n ^ 1} be a sequence such that {iV„ : n 1} is a sequence of integers going 
to infinity, and each fn '■ [NnY — >■ K symmetric and vanishes on diagonals. Define 
Qd{n, X) = Qd{Nn, fn, X) according to ^J\). If as n 00, 
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(i) E(Q,(r2,X)2) ^2z/, 

(ii) E[Qd{n,lLY] - 12E[g,(iV„,/„,X)3] ^ 12Z/2 -48i/, and 

(iii) maxi^i^AT^ Infi(/„) ^ 0, 

i/ien Qrf(n, X) converges in law to Zy ~ '^[y)- 

6.2 A converse statement 

The following statement contains a universal chi-square limit theorem result: it is a general 
version of Theorem I1.14[ 

Theorem 6.3 Let G = {Gi : i ^ \\ he a standard centered i.i.d. Gaussian sequence, 
and fix an even integer d ^ 2. Let {Nn, fn'-n'^l}bea sequence such that {Nn} is a 
sequence of integers going to infinity, and each fn : [A''„]'^ ^ M. is symmetric and vanishes 
on diagonals. Define Qd{n,G) = Qd{Nn, fn,G) , n^l, according to M.l\) . and assume 
that E[Qd{n, G)^] 21/ as n ^ oo. Then, the following are equivalent as n ^ oo. 

(1) The sequence Qd{n, G) converges in law to ~ yi^iy)- 

(2) E\Qd[n, GY\ - \2E\Qd[n, Gf\ ^ \2v'' - 48z/. 

(3) ll/n^d/2/n - Crf X ^ (for Q dcfncd m l\l.r/^ ) and, for every r = 1, d-1 
such that r ^ d/2, -kr /„||2d-2r ^ 0. 

(4) For every sequence X = {Xj : i ^ 1} of independent centered random variables 
with unit variance and such that supj -ElXjp"*"^ < oo for some e > 0, the sequence 
Qd{n,X.) converges in law to Z^ ~ idW)- 

(5) For every sequence X = {Xj : i 1} of independent and identically distributed 
centered random variables with unit variance, the sequence Qd{n,^) converges in 
law to Zy. 

Proof. The proof follows exactly the same lines of reasoning as in Theorem 15.31 Details 
are left to the reader. Let us just mention that the only differences consist in the use 
Theorem EIH] instead of Theorem EISl and the use of (IX^ - dCTD instead of fraiD - (IH3TD . 

■ 

7 Multivariate Extensions 
7.1 Bounds 

We recall here the standard multi-index notation. A multi-index is a vector a G {0, 1, . . 
We write 

\a\=Y,a„ a\ = l[a,\, d, = —, 9" = 9- . . . = JJ^^- 

j=i j=i j=i 

31 



Note that by convention, 0*^ = 1. Also note that = y", where Uj = \xj\ for all j. 
Finally, for ip : M regular enough and A; ^ 1, we put 

||y?('')||oo = max^ sup \d°'ip{z)\. 

\a\=k al zGR"^ 

First we derive a multivariate version of fl4.44l) : see also [121, Theorem 4.1] for a similar 
statement. 

Theorem 7.1 Let X = {Xj, i ^ 1} be a collection of centered independent random 
variables with unit variance and such that P := supj^;^ < oo. Let G = {Gi : i ^ 1} 

be a standard centered i.i.d. Gaussian sequence. Fix integers m ^ 1 and dm ^ ■ ■ ■ ^ 
di ^ 1 as well as Ni, . . . , Nm ^ 1. For every j = l,...,m, let fj : [NjY^ W be a 
symmetric function vanishing on diagonals. Define Q^{G) = Qd {Nj, fj, G) and Q^(X.) = 
Qd,{NjJj,X.) according to (EJP, and assume that E[Q^{Gf] = E[Q^{X.f] = 1 for all 
j = 1, . . . , m. Assume that there exists a G > is such that Yl^i^ maxi^j^^ Infj(/j) ^ 
G. Then, for all thrice differentiable ip : M"* M with ||</?"'||oo < oo, we have 



E[ip{Q\X), . . . , Q^Oq)] -E[ip{Q\G), . . . , g™(G))] 

1 ^ 

^(16V2/?)^d 



/5 



Li=i 



max max Infj(/j^ 



In the one-dimensional case (i.e. m = 1), y^^^^ '^^ maxi^j^mlnfj(/j) = [d\{d — 1)!]~^, so 
we can choose G = [d\{d — . In this case, when (3 is large the bound from Theorem 
17.11 essentially differs from (14.441) by a constant times a factor d. 

Proof. Abbreviate 

Q(X) = (Qi(X),...,g'"(X)), 



and define analogously Q(G). We proceed as for Lemma [4.41 with similar notation. For 
i = 0, . . . , maxj Nj, let Z(*) denote the sequence (Gi, . . . , Gi, Xi^i, . . . , ^maxj Nj^ ■ Using 
the triangle inequality. 



E[^(Q(X))] -E[^(Q(G))] 



J2 i?[v'(Q(z(-i)))]-i5;[y.(Q(z«))] 



i=l 



Fix a particular i G {1, . . . , max^ Nj}, and write, for j = 1, 



U 



U) 



l^ii,...,id^Nj 
3k: ik=i 
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where means that this particular term is dropped. We write Uj = {IJY\ ■ ■ ■ 
and Vj = (V^''^'*, . . . , V^"^^). Note that Uj and Vj are independent of the variables Xi and 
Gi, and that Q(Z(^-i)) = + XiVi and Q(Z(*)) = + GiVi. By Taylor's theorem, 
using the independence of from Uj and Vj as well as E{Xi) = and E{Xf) = 1, we 
have 



(1) 



E[^{V, + X,V,)] - J2 ^E{^MV^)yt)E{xt\: 



J2 sup \dM^)\Em')E{m) ^ y'WooEm') ^ Ei\vt\). 

|a|=3 



l«l=3 



Similarly, 



E[V'(U. + G.V,)] - E lE(9V(U.)V?)i?(G!" 

|a|sC2 

V vr a! ^gRm V vr 



|a|=3 " |«|=3 

Due to the independence and the matching moments up to second order, we obtain that 



/ |a|=3 



Abbreviate Tj = maxi^^^m Inf Next we use that, for j = 1, . . . ,m, by Lemma [4.31 
(with g = 3), we have 



Thus 



E^K^^ 

|a|=3 



j,fc,«=i j,fc,/=i 



(Oi3a 



E^d^^ 



(i)i3a , / 



E(16V2/3)^rf,! 

.i=i 



3/2 



Collecting the bounds, summing over and using that Y.i=V '^i^C gives the desired 
result. 



The next theorem associates bounds to the normal approximation of the vector 

(gi(x),...,g-(x)). 
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Theorem 7.2 Let X = {Xi : i 1} be a collection of centered independent random 
variables with unit variance. Assume moreover that (3 := supj < oo. Fix integers 

m^l, d^^ ...^ di^2 andNi,...,Nm>l- For every j = 1, m, let fj : [NjY^ R 
be a symmetric function vanishing on diagonals. Define Q^(X.) = Qa {Nj, fj, X) according 
to and assume that E[Q^(X.) ] = 1 for all j = 1, . . . ,m. Let V be the m x m 

symmetric matrix given by V{i,j) = E^Q''(X.)Q^ (X.)'j. Let C be as in Theorem \7.1\ Let 
if : M.^ ^ M. be a thrice differentiable function such that ||v5"||oo < oo and ||(^'"||oo < oo. 
Then, for Zy = {Zy,...,Zy) ~ c/Kn{0,V) (centered Gaussian vector with covariance 
matrix V ), we have 



|E[^(gi(x),...,g™(x))] ^ 5^A,, + 2 

\i=l lsC't<j^m 
1 3 

/ lR\ JZ. _ d,--l 



-cy" 



for Aij given by 



dj_ 
V2 



r=l ^ 



r — 1 



Y^{IQV2(3)^ d^. 



max max Infjff,), 



{di + dj — 2r)\ (ll/j -kdi-r /i|U®(2r) + Wfj -^dj-r /ilU®(2'-)) 



+'i-{d,<dj}\l dj\ [ J ) Wfj -kd^-d, /ilU®2d,. 



(7.50) 



Proof. The proof is divided into several steps. 

Step 1: Reduction of the problem. Let G = (6*4)4^1 be a standard centered i.i.d. 
Gaussian sequence. We have 



with 



\E[p{Q\y.),...,Q^{X))\-E[p{Zy)\\^ 5^ + 52 



51 = |E[^(Qi(X),...,Q'"(X))] -E[^(Qi(G 

52 = \E[^{Q\G),...,Q^{G))]-E[^{Z 



.Q™(G))] 



(7.51) 



Step 2: Bounding 61. By Theorem 17.11 we have 

m 



max max IniAfi). 



Step 3: Bounding 62- We wiU not use the resuh proved in [21], since here we do not 
assume that the matrix V is positive definite. Instead, we will rather use an interpolation 
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technique d la Talagrand [S] . Without loss of generahty, we assume in this step that Zy 
is independent of G. Also, for any j = 1, . . . ,m, observe that Q^{G) = Idj{hj) where 



see also (ESE]). For t G [0, 1], set 

^(t) = E[ip{Vl^t{h,{h^), . . .Jdjh^)) + VtZv)], 

so that 

^2 = 1^(1) - ^(0)| ^ sup 

te(o,i) 

We easily see that \E' is differentiable on (0, 1), and that ^''(t) equals 
By integrating by parts, we can write 



E 
E<E 



^{^T^tilM, h^ihm)) + ViZv)Zl 



dip 
dxi 



{VT^z + VtZv)Zl 



™ ( r q2 

j=l I ^ J 



\z = (Ia^(h^),...,Ii^(hm)) , 



dxidxj 



By using e.g. fl8.6ip in order to perform the integration by parts, we can also write 

dip 



E 



dx. 



E < E 



dip 



\z = Zy 



m 



ay 

dxidxj 



{VT~t{I,,{h), . . . ,/,,„(M) + Vtz){D[I,Xh,)],D[I,^{hj)])^ 



d'ip 
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Hence "^'(t) equals 

2 E ^ Qr^{VT^tii,,ih),...,i,jhj) + VtZy) (V{t,j) - -{D[I,Xh^)],D[I,^ih,)]), 

i,j=i L « i \ « 

so that we get: 

m 

S2 < ywooY,^ 



E 



V{i,j)-j{D[hXK)],D[h^{h,)])^ 



Step 4-' Bounding Var((iI)[Jrf.(/ij)], ^[/^^(/ij)])^). Assume, for instance, that i ^ j. 
We have 

{D[I,Xh)],D[I,^ih,)])^ 

= didj / Id^_i[hi{-,a))ld^_i[hj{-,a))da 
Jr 



d^dj / Y 



r=0 



di — \ \ f d^ — \ 



d^dj / Yl 



dj — l\ /dn — 1 



di~l 



didj Y^ ^' 



r=0 



di — l\ fdn — 1 



didj Y^ ^' 



r=0 



di — l\ fdn — 1 



Id,+dj-2-2r{hi{-,a)®rhj{-,a))da by (E2 

Id,+dj-2-2T{hi{-,a) ®r hj{-,a))da 

Id,+dj-2-2r{ I hi{-,a) (^r hj{-,a)da) 
Jr 

Idi+dj-2~2r{hi ®r+l hj) 



did, |^(r - 1)! (^^_ (^^_ ^yd,+d,-2r{hi ®. h,) 



dd,Y^r-l)\ 



r=l 



di — 1\ /di — 1 



r — 1 / V r — 



^ \ld,+dj-2r{hi®rhj). 



Hence, if di < dj, then Yar [{D[IdXhi)], D[Id^{hj)])sj) equals 



i^di + dj '^'l^^^-\\hi®rhj\\ ^(Q(di+dj~2r) 
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while, if di = dj, it equals 



r=l 



^ j (2(ij — 2r)! II /ij^r^j ||^(g)(2dj-2r) • 



Now, let us stress the two following estimates. If r < rfj ^ dj then 

||/ij®r^j||^8(di+dj-2r) ^ II ®r ||^(»(dj+£ij-2r) = {hi ® di-r ll j ® ^. _^ H j) ^®^r 

^ ll^i ®di-r hiWfjmr^hj ®dj-r hj\ 



\hi 0di-r hi\\'^02r + \\hj ®dj-r ^il|^®2r) . 



\i r = di < dj, then 



■j \\^e)(dj-di) 



^ Whi 



By putting all these estimates in the previous expression for Var ((/^[/^.(/li)], D[Idj{hj)])sjj . 
we get, using also (13.361) . that 



di 



Vcir{{D[IdM],D[Id^{h,)])^) <: A,,, 



for Ay defined by fl73UD . 

This completes the proof of the theorem. ■ 

We now translate the bound in Theorem 17. 21 into a bound for indicators of convex sets. 



Corollary 7.3 Let the notation and assumptions from Theorem 7^ prevail. We consider 
the class 7Y(M'") of indicator functions of measurable convex sets in M™. Let 



Bo 



and 



i=l 



1 5C j <j 

m 



n 3 



.i=i 



max max Infjff/ 



1. Assume that the covariance matrix V is the m x m identity matrix Im- Then 

sup \E[h{Q\X.),...,Q'^{X))]-E[h{Zv)]\ ^ 8(Si + 52)3mi. 
/ieW(K'") 



2. Assume that V is of rank k ^ m, and let A = diag{Xi, . . . , X^) be the diagonal 
matrix with the non-zero eigenvalues of V on the diagonal. Let B be amx k column 
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orthonormal matrix (that is, B^B = 1^ and BB^ = 1^), such that V = BAB^, 
and let 



b = max (A-^B^) . (7.52) 

i,j V / i,j 



Then 

sup \E[h{Q\X.),...,Q"'{X.))]-E[h{Zv)]\ ^ Sib^B. + b^B^f^ml 

Remark 7.4 1. We note that 

sup \P[{Q\X),...,Q^{X)) <: z] - P[Zv ^ z]\ 
< sup \E[h{Q\X),...,Q"^{X))]~E[h{Zv)]\. 

Thus Corollary 17.31 immediately gives a bound for Kolmogorov distance. 

2. By using the bound for 62 derived in the proof of Theorem 17.21 above, and fol- 
lowing the same line of reasoning as in the proof of Corollary 17. 3^ we have, by 
keeping the notation of Theorem 17.21 that if Aij for all 2, j = 1, . . . ,m and 

maxi^cj^m maxi^i^cAT^, Infi(/j) 0, then 

[Qd, (iVi, /i, G), . . . , Qa^iNm, fm, G)) ^ ^^(0, V^), as iVi, . . . , N, ^ 00, 

in the Kolmogorov distance. 

Proof. First assume that V is the identity matrix. We partially follow [37], and let $ 
denote the standard normal distribution in M™, and the corresponding density function. 
For h G 7i(M™'), define the following smoothing: 



ht{x) = h (Vty + Vl - tx\ ^{dy), < t < 1. 



The key result, found for example in pUl Lemma 2.11], is that, for any probability measure 
Q on M™, for any W ^ Q and Z ^ ^, and for any < t < 1, we have: 



sup \E[h{W)]-E[hiZ)]\^^ 
Similarly as in [15l page 24], put 



sup \E[ht{W)] - E[ht{Z)] I + 2^Vt 



. (7.53) 



uix,t,z) = {2ntrfexp[-J2— 



2t 

i=l 
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and observe that u{x,t,z) is the density function of the Gaussian vector Y ~ ^[0,tlm) 
taken in 2; — \/l — tx. Substituting z = \fty + \J\ — tx, we get, for < t < 1, 

ht{x) = / h{z)u{x,t, z)dz. 

By dominated convergence, we may differentiate under the integral and obtain, for the 
second derivatives, 

d'^ht , . l-t/", l-t 



(9x? t ./mm 

and, for z 7^ j, 

d'^h , . l-t 



h{z)u{x,t, z)dz -\ — / h{z){zi — ^/l — txifu{x, t, z)dz 



* [X] 



dxidxj t"^ 
For the third partial derivatives 



h{z){zi — a/1 — txi){zj — y/l — txj)u{x, t, z)dz. 



d'ht, , (l-t) 



(l-t) 



x) = —3 / h(z){zi — Vl — tXi)u{x, t, z)dz 



H — / h{z){zi — Vl — txif'u{x,t, z)dz 



and, for i 7^ j. 



d^ht , , (l-t) 



dxfdxj t'^ 



x) = / h{z){zj — Vl — txj)u{x, t, z)dz 



t3 

and, for i, j, k all distinct. 



/ h{z){zi — Vl — tXiY{zj — \/l — tXj)u{x, t, z)dz, 



dxidxjdxk 
(l-t) 



[X) 



3 
|2 



/ h{z){zi — Vl — tXi){zj — Vl — tXj){zk — \/l~-tXk)u{x,t, z)dz. 



t3 

Because ^ /i(z) ^ 1 for all z G M"\ we may bound 



dx'i 



2 



t t2 ^ * ^ t 



Similarly, for i ^ 



dxidx 



t2 "U^''J"L,^^U 
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Thus, we have ||oo ^ l/'^ ^ l/t'^^^. Bounding the third derivatives in a similar fashion 
yields, for all i,j,k not necessarily distinct. 



dxidxjdxk 



<: max {3 E[\n] t + E[\Yn Em] t + |]E[|n|] } 

and so \\h'l'\\^ ^ l/t^/s. With fl73H|) and Theorem Ol this gives that 
sup \E[h{Q\X.),...,Q^{X))]-E[hiZv)]\ 

heW(R™) 



sup \E[ht{Q\X),...,Q"'(X))] -E[ht{Zv)]\+2./^Vi 

?igW(R'") 



This function is minimised for t = / ^(-^i+^a) ^ yigi(ji]2g that 



2Vm 

sup \E[h{Q\-X),...,Q"\-X))]-E[h{Zv)]\ < ^{B^ + B^f^mi , 
as required. 

For Point [21 write 1^ = (Q^(X), . . . , (5'"(X)) for simplicity. For h e n{W^), we have 



E[h{BA^ X A~25' VT)] - E[h{BA^ x A'^E' Zy)]. 



Put g{x) = h{BA^x). Then, g E H(R.'^) and, thanks to the inequality (17.531) . we can 
write 

sup \E[h{W)] - E[h{Zv)]\ 

heW(R™) 



^ sup \E[g{A-^B'^W)] - E[g{A-^B^Zv) 



4 

< - 
3 



sup \E[gt{A^^B^W)]- E[gt{A^^2B^Zv] 

ge'HiM.'') 



2VkVt 



We may bound the partial derivatives of ft{x) = gt{A ^B^x) using the chain rule and 
(1732)1 . to give that 

ll/nU ^ 6Vtand||/ri|oo^&Vi. 

Using Theorem 17.21 and minimising the bound in t as before gives the assertion; the only 
changes are that Bi gets multiplied by 6^ and B2 gets multiplied by 6^. ■ 
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7.2 More universality 



Here, we prove a slightly stronger version of Theorem 11.31 stated in Section [T751 Precisely, 
we add the two conditions (2) and (3), making the criterion contained in Theorem 11.31 
more effective for potential applications. 

Theorem 7.5 Let G = {Gi : i ^ 1} be a standard centered i.i.d. Gaussian sequence, and 
fix integers m ^ 1 and di, dm ^ 2. For every j = 1, m, let {{Nn \ fn^) : n ^ 1} be 
a sequence such that {Nn^ : n ^ 1} is a sequence of integers going to infinity, and each 
function fn^ : [Nn^Y^ ^ M zs symmetric and vanishes on diagonals. Define Q^{n,G) = 
Qdj{Nn \ fn\G), n ^ 1, according to ( fi.ij) . Assume that, for every j = l,...m, the 
sequence E[Q^{n, G)^], n ^ 1, of variances is bounded. Let V be a m x m non-negative 
symmetric matrix, and let ^/^(O, V) be a centered Gaussian vector with covariance matrix 
V . Then, as n ^ oo, the following two conditions are equivalent. 

(1) The vector {Q^{n, G) : j = 1, ...,m} converges in law to yKn{0, V). 

(2) For alii, J = l,...,m, wehaveE[Q'{n,G)Q^{n,G)] V{iJ) and E[Q'{n,Gy] 
3V{i, i)"^ as n oo. 

(3) For alli,j = l,...,m, we have E[Q\n,G)Q^{n,G)~\ V{i,j) and, for all 1 ^ 
i ^ m and 1 ^ r ^ — 1, we have ||/^*^ -kr fn^\\2di-2r 0. 

(4) For every sequence X = {Xi : z ^ 1} of independent centered random variables, with 
unit variance and such that supj£'|Xjp < oo, the vector {Q^{n,X.) : j = l,...,m} 
converges in law to ./Kn{0, V) for the Kolmogorov distance. 

For the proof of Theorem 17.51 we need the following result, which consists in a collec- 
tion of some of the findings contained in the papers by Peccati and Tudor [31]. Strictly 
speaking, the original statements contained in [S^ only deal with positive definite covari- 
ance matrices: however, the extension to a non-negative matrix can be easily achieved by 
using the same arguments as in the "Step 3" of the proof of Theorem I7.2[ 

Theorem 7.6 Fix integers m ^ 1 and dm ^ ■ ■ ■ ^ di ^ 1. Let V = {V{i,j) : i,j = 
1, ...,m} be a m X m non-negative symmetric matrix. For any n ^ 1 and i = 1, . . . ,m, 
let belong to the dj*''' Gaussian chaos Cd^ • Assume that 

F^''^ = {Ft\ • • • , Fl-^) ■■= {hAhf\ • • • , h^{ht^)) n ^ 1, 

is such that 

lim ^] = V{i,j), l^i,3 ^m. (7.54) 

Then, as n oo, the following four assertions are equivalent: 
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(i) For every 1 ^ i ^ m, f/"'' converges in distribution to a centered Gaussian random 
variable with variance V{i,i). 



(a) For every 1 ^ i ^ m, E {Pj;"'^)^ 3V{i,i 



(Hi) For every 1 ^ i ^ m and every 1 ^ r ^ rfj — 1, \\h\^'' ^i"^||i3®(2<ii-2r) 0. 

(iv) The vector F^"^ converges in distribution to the d- dimensional Gaussian vector 
^(0,V^). 

Proof of Theorem 17. 5L The equivalences (1) ^ (2) ■<=^ (3) only consist in a reformu- 
lation of the previous Theorem I7.6[ by taking into account the equality (13.361) and the 
fact that (since we suppose that the sequence E[Q^ {n,GY] of variances is bounded, so 
that an hypercontractivity argument can be applied), if Point (1) is verified, then (17.541) 
holds. On the other hand, it is completely obvious that (4) implies (1), since G is a 
particular case of such an X. So, it remains to prove the implication (1), (2), (3) ^ (4). 
Let Zv = {Z\, . . . , Z™) ~ ^(0, V). We have 

sup |P[gi(n, X) ^ zi, . . . , g™(r2, X) ^ - P[4 ^ Zi, . . . , ^ z„] | ^ ^i'^) + ^^^^ 



with 



51'^) = sup |P[Qi(n,X)^zi,...,Q'"(n,X)^z^] 

-P[gi(n,G)^^i,...,Q'"(n,G)^2^]| 
= snp\P[Q\n,G)^zi,...,Q'^{n,G)^z^]-P[Zl,f^zi,...,Z^^z,n]\. 



By assumption (3), we have that Aij for all j = 1, . . . ,m (with Aij defined by 
(I7.50p ). Hence, Remark 17.41 (point 2) implies that 6n^ 0. By assumption (3) (for 
r = dj — 1) and (I1.14I) - (I1.15I) . we get that max o) Infj(/n"''') — as n ^ oo for all 

j = 1, . . . ,m. Hence, Corollary 17.31 implies that — > 0, which completes the proof. 



8 Some proofs based on Malliavin calculus and Stein's 
method 

8.1 The language of Malliavin calculus 

Let G = {Gi : z ^ 1} be an i.i.d. sequence of Gaussian random variables with zero 
mean and unit variance. In what follows, concerning the Wiener Gaussian chaos, we will 
systematically use the definitions and notation that have been introduced in Section O In 
particular, we shall encode the structure of random variables belonging to some Wiener 
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chaos by means of increasing (tensor) powers of a fixed real separable Hilbert space S). 
We recall that the first Wiener chaos of G (which coincides with the Gaussian space 
generated by G) is the L^-closed Hilbert space composed of random variables of the type 
Ii{h), where h & Sj. We shall denote by L'^{G) the space of all M-valued random elements 
F that are measurable with respect to cr{G} and verify < oo. Also, L'^{fl]9)) 

denotes the space of all i^-valued random elements u, that are measurable with respect 
to cr{G} and verify the relation < oo. 

We stress (see again [271 Ch. 1] or [11]) that any random variable F belonging to 
L^(G) admits the following chaotic expansion: 

oo 

F = E[F] + Y,Uhd), (8.55) 

d=l 

where the series converges in L'^{G) and the symmetric kernels G c? ^ 1, are 

uniquely determined by F. For convenience we also put 

Jo(/o) = E[F], 

so that we can write 

oo 

F = Y,iM. 

d=0 

In the particular case where = L'^{A, , fi), with {A,£/) a measurable space and /i a 
(T-finite and non-atomic measure, ^®'^ = L^A"^, ^Z^'^, fi^"^) is the space of symmetric and 
square integrable functions on A'^. 

Let =5^ be the set of all smooth cylindrical random variables of the form 

F = (7(/i(0i),...,/i(0„)) 

where n ^ 1, g : M" — M is a smooth function with compact support and & Sj. The 
Malliavin derivative of F (with respect to G) is the element of L'^{Q;Sj) defined as 

i=l 

By iteration, one can define the mth derivative D"^F (which is an element of L^(f2, i^®™)) 
for every m ^ 2. For m ^ 1, D'"'^ denotes the closure of ^ with respect to the norm 
II ■ ||m,2, defined by the relation 

\\Frm,2 = E[F']+j2^[\\D'F\M- 

i=l 

We have DIi{h) = h for every h E Sj. The Malliavin derivative D satisfies the following 
chain rule: if (7 : M" — ^> M is continuously differentiable and has bounded partial deriva- 
tives, and if {Fi, Fn) is a vector of elements of 3^'"^, then g{Fi,...,Fn) G D^'^ and 
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Dg{F,, . . . , F„) = ^ ^(Fi, . . . , FJDF,. (8.56) 

i=l * 

A careful application of the multiplication formula (12.281) shows that (18.561) continues to 
hold when {Fi, Fn) is a vector of multiple integrals (of possibly different orders) and g 
is a polynomial in n variables. We denote by S the adjoint of the operator D, also called 
the divergence operator. A random element u G L^{Q;Sj) belongs to the domain of 6, 
noted Dom(5, if and only if it satisfies 



\E{DF, u)^\^Cu VE[F^] for any F G ^, 

for some constant c„ depending only on u. If n G Dom6, then the random variable 6{u) 
is defined by the duality relationship (customarily called "integration by parts formula"): 

E{F6{u)) = E{DF, u)^, (8.57) 

which holds for every F G D^'^. If = L'^{A, ^ , fi) (with fi non-atomic), then the 
derivative of a random variable F = E{F) + Xld>i ^d{hd) can be identified with the 
element of L'^{A x Q) given by 

oo 

D,F = Y,dId-i{hd{;a)), a e A. (8.58) 

d=l 

Next we define the operator L, called the infinitesimal generator of the Ornstein-Ulilenbeck 
semigroup. It acts on square-integrable random variables as follows: a random variable 
F = E{F) + Ylid>i ^d{hd) is in the domain of L, noted DomL, if and only if Ylid>i ^^dihd) 
is convergent in L^, and in this case LF = — X]d>i '^^dihd). The operator L verifies the 
following crucial properties: (i) DomL = and (ii) a random variable F is in DomL 
if and only if F G Dom^D (i.e. F G D^'^ and DF G Dom5), and in this case one has that 
6{DF) = —LF. We also define the operator L~^, which is the pseudo-inverse of L, as 
follows: for every centered random variable F = X]d>i ^d{hd) of L^(G), we set L~^F = 
X]d>i ~\ld{hd). Note that is an operator with values in D^'^. 

Now consider a centered F G L^(G). By using the relations F = L{L~^F) = 
S{—DL~^F), one deduces from f l8.57p and (18.561) that, for every G G D^'^ and for ev- 
ery continuously differentiable : M — M with a bounded derivative, we have 

E[g{G)F] = E[g'{G){DG, -DL~^F)^]. (8.59) 

In particular, with g{x) = x, we get 

E[GF] = E[{DG, -DL-^F)^]. (8.60) 

Moreover, if F = /^(/i), with h G 9)®'^, then L'^F = -\F, and therefore f|g3^ and <!K^ 
become 

E[g{G)F] = -^E[g{G){DG,DF)^] and E[GF] = -^E[{DG, DF) (8.61) 
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Let h e ^®'^ with d^2, and let s ^ be an integer. The following identity is obtained 
by taking F = Id{h) and G = F^^^ in formula fl8.6ip . and then by applying (18.561) : 

ElUhy-''] = ■l±lE[Uhr\\DUh)\\l]. (8.62) 



8.2 Relations following from Stein's method 

Originally introduced in [ITl Stein's method can be described as a collection of prob- 
abilistic techniques, allowing to compute explicit bounds on the distance between the 
laws of random variables by means of differential operators. The reader is referred to [2] 
and [SH], and the references therein, for an introduction to these techniques. The follow- 
ing statement contains four bounds which can be obtained by means of a combination of 
Malliavin calculus and Stein's method. Points 1, 2 and 4 have been proved in [22], whereas 
the content of Point 3 is new. Our proof of such a bound gives an explicit example of 
the interaction between Stein's method and Malliavin calculus. We also introduce the 
following notation: for every centered F G D^'^, we write 

To(F) = v/Var((DF,-DL-iF)^), (8.63) 

where Var indicates variance (observe that, due to (18.601) . E{{DF, —DL^^F)^^ = E{F'^)). 
Note that. 



if F = /,(/), then ro(F) = 




(8.64) 



Proposition 8.1 Consider F = Id{h) with d ^ 1 and h G S)®*^, and let Z and Zy have 
respectively a ^(0, 1) and a ^{y) distribution (u ^ 1). If E{F'^) = 1 then the following 
three bounds hold: 

1. dTv{F,Z) ^2To{F). 

2. dw{F,Z)^To{F). 

3. For every thrice differentiable function <y9 : M ^ R such that ||v5"'|| < oo, 

\E[^{F)]~E[^{Z)]\^C,xT,{F), 

where is given in \3.34 )- 
If E{F'^) = 2v then the following bound is in order: 

I dBw{F, Z,) ^ max |y^, i + ^} y/E[{2u + 2F - {DF, -DL~^F)^y]. 
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Proof. Point 1 and 2 are proved in [221 Theorem 3.1], while Point 4 is proved in [22l 
Theorem 3.11]. To prove Point 3, fix ip as in the statement, and consider the Stein 
equation 



f\x)-xf\x) = ^{x)-E[ip{Z)l xe^. 
It is easily seen that a solution of f l8.65p is given by 

U{x) = e-'/' r {^{y) - E[^{Z)])e-y'"dy. 



According to the forthcoming Lemma 18.21 the following bound is in order: 



\x\ 



(8.65) 



(8.66) 



(8.67) 



where is given in fl3.33p . Now use (18.611) with g = and G = F, as well as a standard 
aproximation argument to take into account that is not necessarily bounded, in order 
to write 

\E[^{F)] - E[^{Z)]\ = \E[f(F) - FU{F)]\ 



= E 
€ K.E 



(1 



\Ff) 



By applying the Cauchy-Schwarz inequality four times, and by exploiting Proposition 12. 71 
one infers 



K.E 



(1 



|F|3) 



\ 



E 



1 - >f II?, 



C.T„{F), 



which is the desired conclusion. ■ 

Lemma 8.2 The function f^p defined in liS. 6b]] verifies relation (8.61). 

Proof. We denote by Z a Gaussian random variable with zero mean and unit variance. 
Observe that 



E\Z\ 



2^2 



and E\Z\ 



V2 



(8.68) 



We want to bound the quantity |/^(x)|, where: (i) (p is such that (p{x) = (p{0) + 
if' {0)x + ip" {0)x^/2 + R{x), with \R{x) \ ^ ||^'"||^ \xf /Q, and (ii) is given by (Km . 
and consequently 



/; (x) = ^ (x) - E^ (Z) + xU (x) = A{x) + B (x) 



(8.69) 
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with A (x) := ip (x) — Eip [Z] and B (x) := xf^p (x). It will become clear later on that 
our bounds on |/^ (x)| do not depend on the sign of x, so that in what follows we will 
uniquely focus on the case x > 0. Due to the assumptions on (yj, we have that 



A{x) 



-C 



^' (0) X + (0) x^ + i? (x) + C 
ax + hx^ + R (x) + C, where 



(note that the term (0) simplifies) and also, by using fl8.68p . 

I/' (0)1 , ll^"1Lv^ 



\c\ ^ 

' ' 2 3 
and (recall that x > 0) 



a 



\A{x)\ ^ |^'(0)|x+i|v."(0)|x^ + i||^'"|loo^' + C' 
= |a| X + |6| x^ + 7x^ + C" 



7 



6 



On the other hand, since EA (Z) = by construction, 

"+00 2 

A{y) e'^dy 



\B(x) 



xe 2 



< xe 2 



:= Fi (x) + Y2 (x) + F3 (x) + I4 (x) . 
We now evaluate the four terms Yi separately (observe that each of them is positive): 



Yi{x) 
Y2{x) 
Ysix) 



C xe 2 



xe 2 



xe 



+00 



e ^2 dy ^ C'e'2 



+00 



+00 



\a\ye 2 dy 



\a\ X 



+00 



ye "2 dy = C; 



+00 2 



|6| y^e 2 dy ^ e'2 
|6| (x2 + 2) =2\b\ + \b\x^; 

~dy = 'jx (x^ + 2) = •yx^ + 27X. 



xe 2 



, ..2 



By combining the above bounds with fl8.69p . one infers that 
|/;(x)| ^ 2C' + 2|6| +x(2|a| +27) +x22|6| +x327 

^ max {2C' + 2 |6| ; 2 \a\ + 27; 2 \b\ ; 27} x (l + x + x^ + x^) 
= max {2C' + 2 |6| ; 2 \a\ + 27} x (l + x + x^ + x=^) , 
which yields the desired conclusion. 
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8.3 Proof of Theorem SH 

Let F = Id{h), h G In view of Proposition \S.1\ it is sufficient to show that 

To(F)=Ti(F)^T2(F). 
Relation (3.42) in [22] yields that 

^\\DF\\l = E{F') + dJ2ir- 1)1 (^^~ J") l2d-2r (h^rh) , 

r=l ^ ^ 



^.70) 



which, by taking the orthogonality of multiple integrals of different orders into account, 
yields 



d-l 



r=l 



^'^(r- {2d-2r)\\\h(g)rh\\lmi<i-r), (8.71) 
and consequently To{F) = Ti{F). From the multiplication formula (12.281) ) we get 

F^ = J2r\('^) hd-2r{h®.,h). (8.72) 



r=0 



To conclude the proof, we use (18.621) with s = 2, combined with (18.701) and (18.721) . as well 
as the assumption that Ei^F"^) = 1, to get that 



E[F^]-3 = ^E{F^DF\\l)-3{d\\\h\\^ 

(Jj 



r - 1)! 



r=l 



J (2rf-2r)!||/i®,/i||J«2(.-.). 



Hence 



thus yielding the desired inequality Ti{F) ^ T2{F). 



8.4 Proof of Theorem ISTG 



Let F = Id{h), h G In view of Proposition [8?T] and since L ^F = —\F, it is sufficient 
to show that 



E 



2u + 2F -hwFWi 
d ^ 



T^{F) ^ T,{F). 
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By taking into account the orthogonality of multiple integrals of different orders, relation 
(IHTD|) yields 



E 



2iy + 2F -^\\DF\\l 



4d\ 



A{d/2)\ 



4 



+ 



r=l,...,d-l ^ ^ 



r"-||^(gl(2d-2r) ! 



and consequently T3(F) 



On the other hand, by com- 



[2v + 2F-\\\DF\\iy 

bining fl8.62p (for s = 1 and s = 2) with fl8.72p . we get, still by taking into account the 
orthogonality of multiple integrals of different orders: 



E[F^] - 12E[F^ 



12z/2 - 48i/ + 24c/! 



lh®d/2h 



+ 3d Yl 



r! (r — 1)\ 



r=l,...,d-l 



A{d/2)\ 



(2d - 2r)! \\h®rh\ 



^®(2d-2r) 



It is now immediate to deduce that T^i^F) ^ Ti{F). 
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